WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




per 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCI) 



(51) International Patent Classification 5 : 

A61K39/04, 02N 15/74, 1/21 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 93/07897 
29 April 1993 (29.04.93) 



(21) International Application Number: PCT/US92/09075 

(22) International Filing Date: 21 October 1992 (21.10.92) 



(30) Priority data: 
780,261 



21 October 1991 (21.10.91) US 



(71) Applicant: MEDIMMUNE, INC [US/US]; 35 West Wat- 

kins Mill Road, Gaithersburg, MD 20878 (US). 

(72) Inventor: STOVER, Charles, KL ; 15105 Timberiake Drive, 

Silver Spring, MD 20904 (US). 

(74) Agents: OLSTEIN, Elliot, M. et al.; Carella, Byrne, Bain, 
GilfiUan, Cecchi & Stewart , 6 Becker Farm Road, Rose- 
land, NJ 07068 (US). 



(81) Designated States: AU, CA, JP, European patent (AT, BE, 
CH, DE, DK, ES, FR, GB, GR, IE, IT, LU, MC, NL, 
SE). 



Published 

With international search report 



(54) Titie: BACTERIAL EXPRESSION VECTORS CONTAINING DNA ENCODING SECRETION SIGNALS OF LIP- 
OPROTEINS 

(57) Abstract 

An expression vector for expressing a protein or polypeptide in a bacterium, which comprises a first DNA sequence encod- 
ing at least a secretion signal of a lipoprotein, and a second DNA sequence encoding a protein or fragment thereof, or polypep- 
tide or peptide heterologous to the bacterium which expresses the protein or fragment thereof, or polypeptide or peptide. The bac- 
terium expresses a fusion protein of a lipoprotein or lipoprotein segment and the protein or fragment thereof, or polypeptide or 
peptide heterologous to the bacterium which expresses the protein or fragment thereof, or polypeptide or peptide. Such expres- 
sion vectors increase the immunogenicrty of the protein or fragment thereof, or polypeptide or peptide by enabling the protein or 
fragment thereof, or polypeptide or peptide to be expressed on the surface of the bacterium. Bacteria which may be transformed 
with the expression vector include mycobacteria such as BCG. The expression vectors of the present invention may be employed 
in the formation of live bacterial vaccines against Lyme disease wherein the bacteria express a surface protein of Borrdia burg- 
dorferi, the causative agent of Lyme disease. 
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BACTERIAL EXPRESSION VECTORS 
CONTAINING DNA ENCODING SECRETION SIGNALS 
OF LIPOPROTEINS 

This application is a continuation-in-part of application 
Serial No. 780, 261, filed October 21, 1991. 

This invention relates to expression vectors for expressing 
a protein in a bacterium, such as for example, a mycobacterium. 
More particularly, this invention relates to expression vectors 
for expressing and secreting proteins which are heterologous to 
the bacterium which expresses such proteins wherein such vectors 
further include DNA encoding at least the secretion signals of 
lipoproteins designed to achieve lipid acylation and surface 
expression of heterologous proteins. 

Certain mycobacteria represent major pathogens of man and 
animals. For example, tuberculosis is generally caused in humans 
by Mycob acterium tuberculosis . and in cattle by Mycoba c terium 
bovi s , which may also be transmitted to humans and other animals. 
Mycoba cteria leprae is the causative agent of leprosy. 
M. tuberc ulosis and mycobacteria of the avium- intracellulare- 
scrofulaceum group (MAIS group) represent major opportunistic 
pathogens of patients with acquired immune deficiency syndrome 
(AIDS). M . ps eudo tube rcu 1 o s i s is a major pathogen of cattle. 

On the other hand, Bacille Calmette-Guerin, or BCG, an 
avimlent strain of M.bovis , is widely used in human vaccines, 
and in particular is used as a live vaccine, which is protective 
against tuberculosis. BCG is the only childhood vaccine which is. 
currently given at birth, has a very low incidence of adverse 
effects, and can be used repeatedly in an individual. (eg., in 
multiple forms). In addition, BCG and other mycobacteria (eg., 
M. sm eqm atis ) , employed in vaccines, have adjuvant properties 
among the best currently known and, therefore, stimulate a 
recipient's immune system to respond to antigens with great 
effectiveness . 
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It has been suggested by Jacobs, et. al, Nature . Vol. 327, 
No. 6122, pgs. 532-535 (June 11, 1987), that BCG could be used as 
a host for -the construction of recombinant vaccines. In other 
words, it was suggested to take an existing vaccine (in this case 
against tuberculosis) and expand its protective repertoire 
through the introduction of one or more genes from other 
pathogens. 

Transformation, the process whereby naked DNA is introduced 
into bacterial cells, has been carried out successfully in 
mycobacteria. Jacobs, et al (1987), as hereinabove cited, have 
described transformation of mycobacteria by electroporation. 
Electroportation can give from 10 5 to 10 6 transformants per ug of 
plasmid DNA and such plasmid DNA 1 s may carry genes for resistance 
to antibiotic markers such as kanamycin, Snapper, et al, PNAS, 
Vol. 85, pgs. 6987-6991 (September, 1988); to allow for selection 
of transformed cells from non- transformed cells. 

Jacobs, et al (1987) and Snapper, et al (1988) have also 
described the use of cloning vehicles such as plasmids and 
bacteriophages, for carrying genes of interest into mycobacteria. 

Lee, et al., .PNAS, Vol. 88, pgs. 3111-3115 (April 1991), 
descriho vectors which employ DNA encoding a mycobacterial phage 
int«grase and phage attachment site to effect site-specific 
integration into a mycobacterial chromosome. Such vectors permit 
stable integration of vectorss encoding foreign antigen genes 
into a mycobacterial chromosome. 

Stover, et al., (Nature, Vol. 351, pgs. 456-460 (June 6, 
1991)) describe integrative and extrachromosomal vectors 
employing mycobacterial HSP60 and HSP70 promoters to express 
foreign antigens cytoplasmically in recombinant BCG. Stover, et 
al. demonstrated that recombinant BCG expressing foreign antigens 
with these vectors could be used as immunogens to generate 
humoral and cellular immune responses to the foreign antigens. 
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Combination of the above-mentioned techniques , along with 
standard tools of molecular cloning (e.g., use of restriction 
enzymes, etc.) allows the cloning of genes of interest into 
vectors and introduction of such genes into mycobacteria. 

In accordance with an aspect of the present invention, there 
is provided an expression vector for expressing a protein or 
polypeptide or peptide in a bacterium. The expression vector 
comprises a first DNA sequence encoding at least a secretion 
signal of a lipoprotein; and preferably further comprises a 
second DNA sequence encoding a protein or fragment thereof or 
polypeptide or peptide heterologous to the bacterium which 
expresses the protein or fragment thereof, or polypeptide or 
peptide, whereby the bacterium expresses a fusion protein of a 
lipoprotein or lipoprotein segment (which may include the 
secretion signal), and the protein or fragment thereof, or 
polypeptide or peptide heterologous to the bacterium which 
expresses the protein or polypeptide or peptide. 

Such an expression vector may be employed in any of a 
variety of bacteria which may be employed in vaccines, including 
live vaccines. In particular, in one embodiment, the bacterium 
is a mycobactrium such as, but not limited to, Mycobac terium 
bovis - BCG, M. sm eq matis , M. avium , M.phlei , M. f ortuitium , M. lufu, 
M. par atuberculosis , M.habana , M. scrof alaceum , M. intracellular , 
and M . vaccae . 

In one embodiment, the mycobacterium is M.bovis -BCG. 

Although the scope of the present invention is not to be 
limited to any theoretical reasoning, it is believed that the 
signal sequence of the lipoprotein enables the expressed 
recombinant fusion protein to be modified such that the protein 
is expressed at the surface of the bacterium as a chimeric 
lipoprotein. For example, the fusion protein may include 
processing or recognition site(s) for signal peptidase II in the 
signal sequence portion, which enables lipid acylation of the 
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fusion protein. Such lipid acylation of the fusion protein may 
enhance the immunogenicity of the heterologous protein or 
fragment thereof, or polypeptide or peptide portion of the fusion 
protein. Also, the signal sequence enables the fusion protein to 
be expressed at and anchored to the surface of the bacterium, 
thus making the heterologous protein or polypeptide more 
accessible, which also may increase the immunogenicity of the 
protein or fragment thereof, or polypeptide or peptide. Also, 
because such fusion proteins may be expressed on the surface of 
the bacterium, such expression or secretion of the fusion protein 
will permit the expression of antigens which may be lethal if 
expressed or maintained cytoplasmically in the bacterium. It is 
to be understood that the heterologous protein or fragment 
thereof, or polypeptide or peptide may itself be a lipoprotein, 
such as the OspA antigen of Borrella burgdorferi, which is 
hereinafter discussed, or a non- lipoprotein, such as, for 
example, HIV antigens, tetanus toxoids, diphtheria toxoids, 
cholera toxoids, pertussis toxoids, and malarial antigens. Thus, 
the expression vectors of the present invention enable the 
genetic engineering of a non-lipoprotein moiety which may become 
anchored to the surface of a bacterium. 

Thus, the expression vectors enable the expression of 
heterologous genes or gene segments (which originally encoded 
non- lipoproteins) as chimeric surface lipoproteins. This is 
accomplished by gene fusion of the foreign genes or gene segments 
to vector encoded genes or gene segments encoding lipoproteins or 
lipoprotein signal peptides, respectively. 

In one embodiment, the first DNA sequence encodes at least a 
secretion signal of a mycobacterial lipoprotein. The 
mycobacterial lipoprotein may, in one embodiment, be an 
M. tuber culosis lipoprotein. The M. tuberc ulosis lipoprotein may 
be selected from the group consisting of the H. tuberculosis 19 
kda antigen and the M. tuberculosis 38 kda antigen. 
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Other lipoproteins, of which at least the secretion signal 
may be encoded by the first DNA sequence include , but are not 
limited to, Braun's lipoprotein of E. coli , S. marcescens . E. 
amylosora, M. morganii , and P. mirabilis , the TraT protein of E. 
coll and Salmonella : the penicillinase (PenP) protein of B. 
licheni formis and B. cereus and S. aureus ; pullulanase proteins 
of Kleb siella pneumoniae and Klebsiella aerogenese : E. coli 
lipoproteins lpp-28, Pal, RplA, RplB, OsmB, NlpB, and Orll7; 
chitobiase protein of V. harsevl ; the p-1, 4-endoglucanase protein 
of Pseudo monas solanacearum , the Pal and Pep proteins of H. 
influenzae; the OprI protein of P. aeruginosa ; the MalX and Ami A 
proteins of S. pneumoniae ; the 34 kda antigen and TpmA protein of 
Treponem a pallidu m; the P37 protein of Mycoplasma hvorhinis : and 
the 17 kda antigen of Rickettsia rickettsii . It is to be 
understood, however, that the scope of the present invention is 
not to be limited to secretion signals of any particular 
lipoprotein or lipoproteins. 

In one embodiment, the first DNA sequence may further 
include DNA which encodes all or a portion of the lipoprotein. 
Thus, in such an embodiment, the fusion protein which is 
expressed by the bacterium is a fusion protein of the secretion 
signal of the lipoprotein, all or a portion of the lipoprotein, 
and the heterologous protein or polypeptide or peptide. 

The first and second DNA sequences are under the control of 
a suitable promoter. In one embodiment, the promoter may be the 
19 kda antigen promoter or the 38 kda antigen promoter of 
M. tube rculosis if DNA encoding the secretion signal of one of 
these antigens is employed. Alternatively, the promoter may be a 
mycobacterial promoter other than the 19 kda and 38 kda 
M.tube rculosus antigen promoters, or a mycobacteriophage 
promoter • 

Mycobacterial and mycobacteriophage promoters which may be 
employed include, but are not limited to, mycobacterial promoters 
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such as the BCG HSP60 and HSP70 promoters; the mycobactin 
promoter from gf. tuberculosis and BCG; the mycobacterial 14 kda 
and 12 kda antigen promoters; the mycobacterial «-antigen 
promoter from' M^nberculosis or BCG; the MBP-70 promoter, the 
mycobacterial 45 kda antigen promoter from ^ tuberculosis or BCG; 
the superoxide dismutase promoter; the mycobacterial asd 
promoter, and mycobacteriophage promoters such as the Bxbl, Bxb2, 
Bxb3, LI, L5, D29 and TM4 promoters. In one embodiment, the 
promoter is a mycobacterial heat shock protein promoter such as 
HSP60 or HSP70. 

Example of expression vectors including the mycobacterial 
promoters and mycobacteriorphage promoters hereinabove described 
are further described in application Serial No. 642,017, filed 
January 16, 1991, which is a continuation of application Serial 
No. 552,828, filed July 16, 1990, now abandoned. The contents of 
application Serial No. 642,017 are hereby incorporated by 
reference . 

In a preferred embodiment, the transcription initiation 
site, the ribosomal binding site, and the start codon, which 
provides for the initiation of the translation of mRNA, are each 
of mycobacterial origin. The stop codon, which stops translation 
of mRNA, thereby terminating synthesis of the heterologous 
protein, and the transcription termination site, may be of 
mycobacterial origin, or of other bacterial origin, or may be 
synthetic in nature, or such stop codon and transcription 
termination site may be those of the DNA encoding the 
heterologous protein or polypeptide. 

Preferably, the mycobacterial promoter is a BCG promoter, 
and the mycobacterium is BCG. 

Heterologous proteins or polypeptides which may be encoded 
by the second DNA sequence include, but are not limited to, 
antigens, anti-tumor agents, enzymes, lymphokines, pharmacologic 
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agents, immunopotentiators, and reporter molecules of interest in 
a diagnostic context. 

Antigens which may be encoded include, but are not limited 
to, My cobacterium leprae antigens; Mycobacterium tuberculosis 
antigens; Rickettsia antigens; Chlamydia antigens; Coxiella 
antigens; malaria sporozoite and merozoite proteins, such as the 
circumsporozoite protein from Plasmodium berghei sporozoites; 
diphtheria toxoids; tetanus toxoids; Clostridium antigens; 
Leishmania antigens; Salmonella antigens; E-coli antigens; 
Listeria antigens; Borrelia antigens, including the OspA and OspB 
antigens of Borrelia burgdorferi ; Franciscella antigens; Yersinia 
antigens; Mycobacterium af ricanum antigens; Mycobacterium 
intrace llulare antigens; Mycobacterium avium antigens; Trepon ema 
antigens; Schistosome antigens; Filaria antigens; Pertussis 
antigens; Staphylococcus antigens; Herpes virus antigens; 
influenza and parainfluenza virus antigens; measles virus 
antigens; Bordatella antigens; Hemophilus antigens; Streptococcus 
antigens, including the M protein of S. pyogenes and pneumococcus 
antigens such as Streptococcus pneumoni ae antigens; mumps virus 
antigens; hepatitis virus antigens; Shigella antigens; Neisseria 
ant j gens; rabies antigens; polio virus antigens; Rift Valley 
Fever virus antigens; dengue virus antigens; measles virus 
antigens; rotavirus antigens; Human Immunodeficiency. Virus (HIV) 
antigens, including the gag, pol, and env proteins; respiratory 
syncytial virus (RSV) antigens; snake venom antigens; human tumor 
antigens; and Vibrio cholera antigens. Enzymes which may be 
encoded include, but are not limited to, steroid enzymes. 

In one embodiment, the second DNA sequence encodes at least 
one protein or polypeptide or fragment or derivative thereof 
which includes an epitope which is recognized by cytotoxic T 
lymphocytes induced by an HIV protein or fragment or derivative 
thereof. The at least one DNA sequence may encode an HIV protein 
or fragment or derivative thereof. HIV proteins or polypeptides 
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which may be encoded by the at least one DNA sequence includes 
but are not limited to, HIV-I-gp 120; HIV-r-gp 41; HIV-I-gp 160 
HIV-I-pol; HIV-I-nef; HIV-I-tat; HIV-1-rev; HIV-I-vif; HIV-I-vpr 
HIV-I-vpu; HIV-I-gag; HIV-2gp 120; HIV-2-gp 160; HIV-2-gp 41 
HIV-2-gag; HIV-2-pol; HIV-2-nef; HIV-2-tat; HIV-2-rev; HIV-2-vif 
HIV-2-vpr; HIV-2-vpu; and HIV-2-vpx. 

Anti-tumor agents which may be encoded include, but are not 
limited to, interf eron-o , interferon-^, or interferon- y, and 
tumor necrosis factor, or TNF- Lymphokines which may be encoded 
include, but are not limited to, inter leukins 1 through 8. 

It is also contemplated that the heterologous protein or 
polypeptide may be a reporter molecule or selectable marker. 

Reporter molecules which may be encoded include-, but are not 
limited to, lucif erase, B-galactosidase, B-glucuronidase, and 
catechol dehydrogenase - 

Other peptides or proteins which may be encoded include, but 
are not limited to, those which encode for stress proteins, which 
can be administered to evoke an immune response or to induce 
tolerance in an autoimmune disease (e.g., rheumatoid arthritis). 

Selectable markers which may be encoded include, but are not 
limited to, the p-galactosidase marker, the kanamycin resistance 
marker, the chloroamphenicol resistance marker, the neomycin 
resistance marker, and the hygromycin resistance marker, 
bacteriophage resistance markers, or genes encoding enzymes 
involved in the synthesis of nutritional elements, such as amino 
acids. 

In accordance with one embodiment, the vector further 
includes a mycobacterial origin of replication. 

In accordance with another embodiment, the vector may be a 
plasmid. The plasmid may be a non-shuttle plasmid, or may be a 
shuttle plasmid which further includes a bacterial origin of 
replication such as an E.coli origin of replication, a Bacillus 
origin of replication, a Staphylococcus origin of replication, a 
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Streptomvces origin of replication, or a streptococcal origin of 
replication. In one embodiment, the shuttle plasmid includes an 
E. coli origin of replication. 

In accordance with yet another embodiment, the vector may 
further include a multiple cloning site, and the second DNA 
sequence encoding for the heterologous protein is inserted in the 
multiple cloning site. 

In another embodiment, the expression vector may be, for 
example, a temperate shuttle phasraid or a bacterial-mycobacterial 
shuttle plasmid. Each of these vectors may be used to introduce 
the first DNA sequence encoding at least the secretion signal of 
a lipoprotein and a second DNA sequence encoding a protein or 
fragment thereof, or polypeptide or peptide heterologous to the 
mycobacterium which expresses the protein or fragment thereof, or 
polypeptide or peptide stably into mycobacteria, in which the DNA 
seqeunces may be expressed. When a shuttle phasmid, which 
replicates as a plasmid in bacteria and a phage in mycobacteria, 
is employed, Integration of the phasmid, which includes the first 
DNA sequence encoding at least the secretion signal of a 
lipoprotein, and a second DNA sequence endoing a protein or 
fragment thereof, or polypeptide or peptide heterologous to the 
mycobacterium which expresses the protein or fragment thereof, or 
polypeptide or peptide, into the mycobacterial chromosome, occurs 
through site- specific integration. The DNA seqeunces are 
replicated as part of the chromosomal DNA. When a 
bacterial-mycobacterial shuttle plasmid is employed, the DNA 
sequences are stably maintained extrachormosomally in a plasmid. 
Expression of the DNA sequences occur extrachromosomally (e.g., 
episomally). For example, the DNA sequences are cloned into a 
shuttle plasmid and the plasmid is introduced into a 
mycobacterium such as those hereinabove described, wherein the 
plasmid replicates episomally. Examples of such shuttle phasmids 
and bacterial-mycobacterial shuttle plasmids are further 
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described in Application Serial No. 361,944, filed June 5, 1989, 
which is hereby incorporated by reference. 

in addition to the first DNA sequence encoding at least the 
secretion signal of a lipoprotein and the second DNA sequence 
encoding a heterlogous protein or fragment thereof, or 
polypeptide or peptide, and the mycobacterial promoter for 
controlling expression of the DNA encoding the heterologous 
protein or polypeptide, the expression vector may, in one 
embodiment, further include a DNA sequence encoding bacteriophage 
integration into a mycobacterium chromosome. Bacteriophages from 
which the DNA sequence encoding bacteriophage integration into a 
mycobacterium chromosome may be derived include, but are not 
limited to, mycobacteriophages such as but not limited to, the 
L5, LI, Bxbl, and TM4 mycobacteriophages; the lambda phage of 
coli; the toxin phages of Corynebacteria; phages of Actinomycetes 
and Norcardia; the *C31 phage of Streptomyces ; and the P22 phage 
of Salmonella. Preferably, the DNA sequence encodes 
mycobacteriophage integration into a mycobacterium chromosome. 
The DNA sequence which encodes bacteriophage integration into a 
mycobacterium chromosome may include DNA which encodes integrase, 
which is a protein that provides for integration of the vector 
into the mycobacterial chromosome. Preferably, the DNA sequence 
encoding mycobacterial phage integration also includes DNA which 

encodes an attP site. 

The DNA encoding the attP site and the integrase provides 
for an integration event which is referred to as site-specific 
integration. DNA containing the attP site and the integrase gene 
is capable of integrating into a corresponding attB site of a 
mycobacterium chromosome. 

It is to be understood that the exact DNA sequence encoding 
the attP site may vary among different phages, and that the exact 
DNA sequence encoding the attB site may vary among different 
mycobacteria. 
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Examples of DNA which is a phage DNA portion encoding 
bacteriophage integration into a mycobacterlum chromosome are 
further described in Application Serial No. 869,330, filed April 
15, 1992, which is a continuation-in-part of Application Serial 
No. 553,907, filed July 16, 1990, now abandoned. The contents of 
Application Serial No. 869,330 are incorporated by reference. 

The vectors of the present invention may be employed to 
transform bacteria, and in particular, mycobacteria which 
include, but are not limited to, Mycobacterium bovis - BCG, M. 
smeomatis. M. avium , M. phlei , M. fortuitum . M. lufu . M. 
paratuberculosis . M. habana. M. scrof alaceum . M. intracellulare 
and M. vaccae: in particular, such vectors may be employed to 
transform BCG. The transformed mycobacteria thus express the 
heterologous protein, which, as hereinabove stated, may be an 
antigen, which induces an immune response, or a therapeutic 
agent. Thus, the transformed mycobacteria may be employed as 
part of a pharmaceutical composition, such as a vaccine and/or 
therapeutic agent, which includes the transformed mycobacteria, 
and an acceptable pharmaceutical carrier. Acceptable 
pharmaceutical carriers include, but are not limited, to mineral 
oil, alum, synthetic polymers, etc. Vehicles for vaccines and 
therapeutic agents are well known in the art and the selection of 
a suitable vehicle is deemed to be within the scope of those 
skilled in the art from the teachings contained herein. The 
selection of a suitable vehicle is also dependent upon the manner 
in which the vaccine or therapeutic agent is to be administered. 
The vaccine or therapeutic agent may be in the form of an 
injectable dose and may be administered intramuscularly, 
intravenously, orally, intradermally, or by subcutaneous 
administration. 

The mycobacteria are administered in an effective amount. 
In general, the mycobacteria are administered in an amount of 
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from about 1 x 10 5 to about 1 X 10 10 colony forming units (CFU's) 
per dose. 

Other -means for administering the vaccine or therapeutic 
agent should be apparent to those skilled in the art from the 
teachings herein; accordingly, the scope of the invention is not 
to be limited to a particular delivery form. 

As hereinabove noted, the expression vectors of the present 
invention may contain DNA which encodes Borrelia antigen(s), 
including but not limited to surface proteins or antigens of 
Borreli a burgdorferi , the causative agent of Lyme disease. Thus, 
in accordance with an aspect of the present invention, there is 
provided a method of protecting an animal against Lyme disease 
which comprises administering to an animal mycobacteria 
transformed with DNA which includes at least one DNA sequence 
which encodes a protein or polypeptide which elicits antibodies 
against Borrelia burgdorferi . The mycobacteria are administered 
in an amount effective to protect an animal against Lyme disease. 
Such amounts may be those hereinabove described. In one 
embodiment, the at least one DNA sequence encodes a surface 
protein of Borrelia burgdorferi or a fragment or derivative 
thereof. Surface proteins of Borrelia burgedorferi which may be 
encoded by the at least one DNA sequence, include but are not 
limited to, Outer Surface Protein A and Outer Surface Protein B, 
sometimes hereafter referred to as OspA and OspB, respectively. 

The transformed mycobacteria include those hereinabove 
described. In one embodiment, the mycobacteria are of the species 
M.bo vis - BCG. 

The at least one DNA sequence which encodes a protein or 
polypeptide which elicits antibodies against Borrelia 
burgdo rferi , in a preferred embodiment, is contained in a 
mycobacterial expression vector. In one embodiment, the 
mycobacterial expression vector may include a DNA sequence 
encoding at least a secretion signal of a lipoprotein, such as 
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those hereinabove described, and wherein the mycobacterium 
expresses a chimeric fusion protein of the lipoprotein or 
lipoprotein segment (which may include the secretion signal) and 
the protein or polypeptide which elicits antibodies against 
Borr e lia burgdorferi « Such an expression vector enables the 
protein or polypeptide which elicits antibodies against Borrelia 
burg dorferi . to be expressed on the surface of the mycobacterium/ 
whereby the protein or polypeptide becomes more accessible. 

It is also contemplated that, in another embodiment, the 
mycobacterial expression vector may contain DNA which encodes all 
or a portion of a mycobacterial excretion protein, as well as the 
DNA which encodes a protein or polypeptide which elicits 
antibodies against Borrelia burgdorferi . The mycobacterium 
expresses a fusion protein of the mycobacterial excretion protein 
or a portion thereof, and the protein or polypeptide which 
elicits antibodies against Borrelia burgdorferi . Such an 
expression vector enables the protein or polypeptide to be 
excreted from the mycobacterium • Examples of mycobacterial 
excretion proteins which may be encoded, include, but are not 
limited to, the a-antigen of M. tuberculosis and BCG. 

The mycobacterial expression vector, in one embodiment, may 
include a promoter selected from the group consisting of 
mycobacterial promoters and mycobacteriophage promoters, such as 
those hereinabove described, and/or may include a DNA sequence 
encoding bacteriophage integration into a mycobacterium 
chromosome, also as hereinabove described* 

In another embodiment, the mycobacterial expression vector 
may be a plasmid, such as a non-shuttle plasmid or a shuttle 
plasmid which further includes a bacterial origin of replication, 
also as hereinabove described. 

It is also contemplated that the mycobacterial expression 
vector may be a temperate shuttle phasmid or a 
bacterial-mycobacterial shuttle plasmid as hereinabove described. 
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The transformed mycobacteria are employed as part of a 
composition for protecting an animal against Lyme disease. Such 
a composition includes the transformed mycobacteria, and an 
acceptable phararaaceutical carrier such as -those hereinabove 
described. 

The invention will now be further described with respect to 
the following examples; however, the scope of the present 
invention is not intended to be limited thereby. 

Example 1 

A . pnnntruetion of plasmlds including mycobacterial promoter 
expression cassette. 
1. Construction of PYUB125 

Plasmid pAL5000, a plasmid which contains an origin of 
replication of M. fortuitum , and described in Labidi, et al. , 
FEMS M icrobiol. Lett ., Vol. 30, pgs. 221-225 (1985) and in Gene, 
Vol. 71, pgs. 315-321 (1988), is subjected to a partial Sau 3A 
digest, and 5kb fragments are gel purified. A 5kb fragment is 
then ligated to Bam HI digested pU666 (an. E. coli vector 
containing an E. coli origin of replication and also carries 
neomycin- kanamycin resistance, as described in Kieser, et al., 
Gene , Vol. 65, pgs. 83-91 (1988) to form plasmid pYUB12. A 
schematic of the formation of plasmid pYUB12. A schematic of the 
formation of plasmid pYUB12 is shown in Figure 1. pYUB12 and 
pIJ666 were then transformed into K. smeqmatis and BCG. 
Neomycin-resistant transforraants that were only obtained by 
pYUB12 transformation confirmed that pAL5000 conferred autonomous 
replication to pIJ666 in M. Bmeomatis and BCG. 

Shotgun mutagenesis by Snapper, et al (1988, hereinabove 
cited) indicated that no more than half of the pAL5000 plasmid 
was necessary to support plasmid replication in BCG. This 
segment presumably carried open reading frames ORF1 and ORF2, 
identified by Rauzier, et al., Gene , Vol. 71, pgs. 315-321 
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(1988), and also presumably carried a mycobacterial origin of 

replication. pYUB12 is then digested with Hpal and EcoRV, a 2586 

bp carrying this region or segment pALSOOO is removed and ligated 

to PvuII digested pYUB8. Plasmid pYUB8 (a pBR322 derivative) 

R 

includes an E. coli replicon and a kan (aph) gene. Ligation of 
the 2586 bp pYUB12 fragment to PvuII digested pYUB8 results in 
the formation of pYUB53, as depicted in Figure 2. Transformation 
of pYUB53 confirmed that the EcoRV-Hpal fragment, designated 
M.rep, was capable of supporting autonomous replication in BCG. 

Plasmid pYUB53 was then digested with AatI, EcoRV, and PstI 
in order to remove the following restriction sites: 

AatI 5707 

EcoRI 5783 

BamHI 5791 

Sail 5797 

PstI 5803 

PstI 7252 

Sail 7258 

BamHI 7264 

EcoRI 7273 

Clal 7298 

Hindi I I 7304; and 

EcoRV 7460 

Fragment ends are then flushed with T4 DNA polymerase and 
religated to form plasmid pYUB125, construction of which is shown 
in Figure 3. 

2. Elimination of superfluous vector DNA from PYUB125 

792 bases of the tet gene, which had been inactivated by 

prior manipulations, was eliminated by a complete Narl digest, 

gel purification of the 6407 bp fragment, and 

ligation/recirculation, transformation of E. coli strain HBlOl, 

R 

and selection of Kan transformants. The construction of 
resulting plasmid, pMVlOl, is schematically indicated in Figure 
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4, and the DNA sequence of pMVlOl, which includes markings of 
regions which will be deleted, and of mutations, as hereinafter 
described, is shown in Figure 5. 

3. Elimination of undesirabl e restriction sites in aph 

(kan R V gene. 

To facilitate future manipulations, the Hindi 1 1 and Clal 
restriction sites in the aph gene were mutagenized simultaneously 
by polymerase chain reaction (PCR) mutagenesis according to the 
procedure described in Gene, Vol. 77 pgs. 57-59 (1989). The 
bases changed in the aph gene were at the third position of 
codons (wobble bases) within each restriction site and the base 
substitutions made were designed not to change the amino acid 
sequence of the encoded protein. 

Separate PCR reactions of plasmid pMVlOl with primers 
ClaMut-Kan + HindRMut-Kan and HindEWut-Kan + Barn-Kan were 
performed at 94°C (1 min. ) , 50°C (1 min.), and 72°C (1 min. ) for 
25 cycles. The PCR primers had the following base sequences: 

ClaMut-Kan 

CTT GTA TGG GAA GCC CC 
Hi ndRMut-Kan 

GTG AGA ATG GCA AAA GAT TAT GCA TTT CTT TCC AG 
H indFMut-Kan 

GTC TGG AAA GAA ATG CAT AAT CTT TTG CCA TTC TCA CCG G 
B arn-Kan 

CGT AGA GGA TCC ACA GGA CG 
The resulting PCR products were gel purified and mixed and a 
single PCR reaction without primers was performed at 94°C (1 
min.), 72 °C (1 min.) for 10 cycles. Primers ClaMut-Kan and 
Bam-Kan were added and PCR was resumed at 94°C (1 min.), 50°C (1 
min.), and 72°C (2 rain.) for 20 cylces. The resulting PCR 
product (Kan. mut) was digested with BamHI and gel purified. 
Plasmid pMVlOl was digested with Clal and cohesive ends were 
filled in by Klenow + dCTP + dGTP. Klenow was heat inactivated 
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and the digest was further digested with BamHI. The 5232 base 

pair fragment was gel purified and mixed with fragment Kan.mut 

and ligated. The ligation was transformed into E. coli strain 
R 

HB101 and Kan colonies were screened for plasmids .resistant to 
Clal and Hindlll digestion. Such plasmids were designated as 
pMVUO, which is depicted in Figure 4. 

4. Elimination of sequences not necessary for plasmid 

replication in mycobacteria. 

Plasmid pMVUO was resected in separate constructions to 
yield plasmids pMVlll and pMV112. In one construction, pMVUO 
was digested with Narl and Ball, the ends were filled in, and a 
5296 base pair fragment was ligated and recircularized to form 
pMVlll. In another construct, pMVUO was digested with Ndel and 
SplI, the ends were filled in, and a 5763 base pair fragment was 
ligated and recircularized to form pMV112. Schematics of the 
constructions of pMVlll and pMV112 are shown in Figure 6. These 
constructions further eliminated superfluous E. coli vector 
sequences derived from pALSOOO not necessary for mycobacterial 
replication. Cloning was performed in E. coli . Plasmids pMVlll 
and pMV112 were tested for the ability to replicate in M. 
smegmatis. Because both plasmids replicated in M. smeomatis the 
deletions of each plasmid were combined to construct pMV113. 
(Figure 6). 

To construct pMV113, pMVlll was digested with BamHI and 
EcoRI, and a 1071 bp fragment was isolated. pMV112 was digested 
with BamHI and EcoRI, and a 3570 bp fragment was isolated, and 
then ligated to the 1071 bp fragment obtained from pMVlll to form 
pMV113. These constructions thus defined the region of pALSOOO 
necessary for autonomous replication in mycobacteria as no larger 
than 1910 base parls. 

5. Mutagenesis of restriction sites in mycobacterial 

replicon. 
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To facilitate further manipulations of the mycobacterial 
replicon, PGR mutagenesis was performed as above to eliminate the 
Sal I EcoRI, and BglZZ sites located in the open readxng frame 
known as OREl of pALSOOO. PCR mutagenesis was performed at 
wobble bases within each restriction site and the base 
substitutions were designed not to change the amino acid sequence 
of the putative encoded 0RF1 protein. The restriction sites were 
eliminated one at a time for testing in mycobacteria. It was 
possible to eliminate the Sail and EcoRI without alterxng 
replication in m. smeomatis. In one construction PCR mutagenesis 
was performed at EcoRI1071 of P H7113 with primers Eco Nut - M.rep 
and Bam-M.rep to form P MV117, which lacks the EcoRI1071 site. 
Primer Eco Hut - M.rep has the following sequence: 

TCC GTG CAA CGA GTG TCC CGG A; 

and Bam-M.rep has the following sequence: 

CAC CCG TCC TGT GGA TCC TCT AC. 
In another construction, PCR mutagenesis was performed at 
the Sail 1389 site with primer Sal Hut - M.rep and Bam-M.rep to 
form PMV119, which lacks the Sail 1389 site. Primer Sal Mut- 
M.rep has the following sequence: 

TGG CGA CCG CAG TTA CTC AGG CCT. 
PMV117 was then digested with ApaLI and Bglll, and a 3360 bp 
fragment was isolated. P MV119 was digested with ApaLI and Bglll, 
and a 1281 bp fragment was isolated and ligated to the 3360 bp 
fragment isolated from pMV117 to form pMV123. A schematic of the 
constructions of plasmids P MV117, P MV119, and pMV123 is shown in 
Figure 7. Elimination of the Bglll site, however, either by PCR 
mutagenesis or Klenow fill in, eliminated plasmid replication in 
mycobacteria, thus suggesting that the Bglll site is in proximity 
to, or within a sequence necessary for mycobacteria plasmid 

replication. 

fi construction of PMV 200 series vectors. 
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To facilitate manipulations of all the components necessary 

for plasmid replication in E. coll and mycobacteria/ (E. rep. and 

M. rep.) and selection of recombinants (Kan R ) , cassettes of each 

component were constructed for simplified assembly in future 

vectrs and to include a multiple cloning site (MCS) containing 

unique restriction sites and transcription and translation 

terminators. The cassettes were constructed to allow directional 

cloning and assembly into a plasmid where all transcription is 

unidirectional, 
p 

Kan Cassette 

A DNA cassette containing the aph (Kan R ) gene was 
constructed by PCR with primers Kan 5 ' and Kan 3 '. An Spel site 
was added to the 5' end of the PCR primer Kan3 * , resulting in the 
formation of a PCR primer having the following sequence: 

CTC GAC TAG TGA GGT CTG CCT CGT GAA G. 

Bam HI + Nhel sites were added to the 5' end of the primer 
Kan5', resulting in the formation of a PCR primer having the 
following sequence: 

CAG AGG ATC CTT AGC TAG CCA CT GAC GTC GGG G. 

PCR was performed at bases 3375 and 4585 of pMV123, and 
BamHI and Nhel sites were added at base 3159, and an Spel site 
was added at base 4585. Digestion with BamHI and Spel, followed 
by purification resulted in a 1228/2443 Kan R cassette bounded by 
BamHI and Spel cohesive ends with the direction of transcription 
for the aph gene proceeding from BamHI to Spe I. 

E. rep, cassette 

A DNA cassette containing the ColEI replicon of pUC19 was 
constructed by PCR with primers E.rep/Spe and E.rep/Mlu. An Spel 
site was added to the 5* end of PCR primer E.rep/Spe and an Mlul 
site was added to the 5 1 end of PCR primer E.rep./Mlu. The 
resulting primers had the following sequences: 

E. rep. /Spe 

CCA CTA GTT CCA CTG AGC GTC AGA CCC 
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K-T-ep./Mltt 

GAC AAC GCG TTG CGC TCG GTC GTT CGG CTG. 

PGR was performed at bases 713 and 1500 of pUC19, and an 
Mlul site was added to base 713, and a Spel site was added to 
base 1500. Digestion with Mlul and Spel, followed by 
purification resulted in an E.rep. cassette bounded by Spel and 
Mlul cohesive ends with the direction of transcription for RNA I 
and RNA II replication primers proceeding from Spel to Mlul. 

M.rep. cassette 

A DNA cassette containing sequences necessary for plasmid 
replication in mycobacteria was constructed by PCR of pMV123 with 
primers M.rep/Mlu and M. rep/Bam. An Mlul site was added to the 
5' end of PCR primer M.rep/Mlu. A BamHI site was added to the 5' 
end of PCR primer M/rep/Bam. The resulting PCR primers had the 
following base sequences: 

M .rep. /Mlu 

CCA TAC GCG TGA GCC CAC CAG CTC CG 
M.rep. /Bam 

CAC CCG TCC TGT GGA TCC TCT AC 

PCR was performed at bases 134 and 2082 of pMV123. An Mlul 
sited was added to base 2082. Digestion with BamHI and Mlul, 
followed by gel purification resulted in a. 1935 base pair DNA 
cassette bounded by Mlul and BamHI cohesive ends with the 
direction of transcription for the pAL5000 0RF1 and 0RF2 genes 
proceeding from Mlul to Bam HI. 

The Kan R , E.rep, and M.rep PCR cassettes were then mixed in 
equimolar concentrations and ligated, and then transformed in E± 
coli strain HB101 for selection of Kan R transf ormants . Colonies 
were screened for the presence of plasmids carrying all three 
cassettes after digestion with BamHI + Mlul + Spel and designated 
pMV200. An additional restriction site, Ncol, was eliminated 
from the M.rep cassette by digestion of pMV200 with Ncol, fill in- 
with Klenow, and ligation and recircularization, resulting in the 
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formation of pMV201. A schematic of the formation of pMV200 from 
pMV123 and pUC19, and of pMV201 from pMV200, is shown in Figure 
8. Plasmids pMV200 and pMV201 were transformed into M, smegma tis 
and BCG. Both plasmids yielded Kan R transformants, thus 
indicating their ability to replicate in mycobacteria. 

A synthetic multiple cloning sequence (MCS) (Figure 9) was 
then designed and synthesized to facilitate versatile molecular 
cloning and manipulations for foreign gene expressions in 
mycobacteria, and for integration into the mycobacterial 
chromosome. The synthetic MCS, shown in Figure 9, contains 16 
restriction sites unique to pMV201 and includes a region carrying 
translation stop codons in each of three reading frames, and a Tl 
transcription terminator derived from E. coli rrnAB ribosoraal RNA 
ope r on. 

To insert the MCS cassette, pMV201 was digested with Narl 
and Nhel, and the resulting fragment was gel purified. The MCS 
was digested with HinPI and Nhel and, the resulting fragment was 
gel purified. The two fragments were then ligated to yield 
pMV204. A schematic of the construction of pMV204 is shown in 
Figure 10. 

Plasmid pMV204 was then further manipulated to facilitate 
removal of the M.rep cassette in further constructions. plW204 
was digested with Mlul, and an Mlul - Not I linker was inserted 
into the Mlul site between the M.rep and the E.rep to generate 
pMV206. A schematic of the construction of pMV206 from pMV204 is 
shown in Figure 11, and the DNA sequence of pMV206 is given in 
Figure 12. 

7. Insertion of BCG HSP60 promoter sequence . 

The published sequence of the BCG HSP60 gene (Thole, et al., 
Infe c t, and Immun. , Vol. 55, pgs. 1466-1475 (June 1987)), and 
surrounding sequence permitted the construction of an HSP60 
promoter fragment by PCR. The 251 bp HSP60 promoter fragment 
(Figure 13, and as published by Stover, et al. (1991)) was 
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amplified by PCR with primers including added Xbal and Nhel 
sites The PCR HSP60 fragment is then digested with Xbal and 
Nhel,* and ds ligated into Xbal digested P MV206 to form P RB26 

(FlgU r ^Ingestion of ™t encodinfl the TP " tuberculosis 

^ T ^o and OSPA .. em tio into m ycobacterial expression 

^^^e sequence of the 19 kda M tuberculosis gene is given in 
Ashbridge, et al. , Nucleic Acids Research, Vol. 17 P g. 1249 
U989>. The 19 kda antigen gene ribosomal binding site, start 
codon, and signal sequence from M tuberculosis chromosomal DNA 
were amplified by PCR with nucleotide primers The result ng 15 
bp fragment (Figure 15) obtained by PCR includes added Bglll (5 ) 
and BamHI: EcoRI sites (3'). This fragment contains the entxre 
5- region of the 19 kda gene up to the 27th codon with the 
exception of the promoter sequence. The PCR fragment is digested 
with Bglll and EcoRI and ligated into BamHI -EcoRI digested pRB26 
to form p2619S (Figure 16). 

The gene encoding the OspA antigen is described in 
Bergstrom, et al., Molecular Microbiology, Vol. 3, No. 4, pgs. 
479-486 (1989). The OspA gene sequence, excluding only the 
N-terminal 18 codons (encoding the secretion signal) was derived 
by PCR with added BamHI (5«) and Sail (3') slices to provide a 780 
bp OspA fragment. P 2619S was digested with BamHI and Sail, and 
the 780bp PCR OspA fragment was digested with BamHI and Sail to 
generate cohesive ends and ligated to BamHI and Sail digested 
P 2619S to form p2619::Os P A. (Figure 17). 



Example 2 

rvmsfrT-uctlon of mycobacterial vector 
-including promoter *«d DNA encoding signal 
S pmience of 19 kda Vf - tuberculosis antigen 
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Plasmid pMV206 was constructed as described in Example 1. 
The 19 kda M. tuberculosis antigen gene promoter/ ribosoraal 
binding site, start codon, and secretion signal was amplified by 
PCR with nucleotide primers. The PCR fragment includes added 
Xbal and BamHI sites. This sequence, shown in Figure 18, which Is 
286 bp in length, includes the entire published 5 f region of the 
19 kda gene up to the 27th codon. The PCR fragment was then 
digested with Xbal and BamHI, and ligated into Xbal and BamHI 
digested pMV206 to form pl9PS (Figure 19). The 780 bp OspA PCR 
cassette, as described in Example 1, was digested with BamHI and 
Sail, and ligated to BamHI and Sail digested pl9PS to form 
pl9PS: :0spA. 

Example 3 

Construction of mycobacterial expression 
vector with M. tuberculosis 38 kda antigen 
promoter and signal sequence and OspA gene 
The gene sequence for the M. tuberculosis 38 kda antigen is 
given in Andersen, et al., Infection and Immunity , Vol. 57, No. 
8, pgs. 2481-2488 (Aug. 1989)- A DNA sequence encoding the 38 
kda antigen promoter, ribosomal binding site, start codon, and 
secretion signal, obtained from M. tuberculosis chromosomal DNA, 
and containing the entire 5' sequence up to the 45th codon, was 
amplified by PCR with nucleotide primers. The resulting PCR 
fragment includes added Xbal and BamHI sites. The PCR fragment, 
297 bp in length, and shown in Figure 20, was digested with Xbal 
and BamHI, and ligated into Xbal and BamHI digested pMV206 to 
form p38PS (Figure 21). The 780 bp OspA PCR cassette, as 
hereinabove described in Examples 1 and 2, is digested with BamHI 
and Sail and ligated into BamHI and Sail digested p38PS to form 
p38PS::0spA. 



Example 4 
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n :+Ti, e t:ior mveobacteri al expression 

™r*»r» with Agression cassette based on 
BCC HSP60 arif * OspA gene 
pM V206 wis constructed as hereinabove described in Example 

The published sequence of the BCG HSP60 gene (Thole, et al, 
Infec t, .nd Immun. , Vol. 55, pgs. 1466.1475 (June 1987,), and 
bounding sequence permitted the construction of a cassette 
carrying expression control sequences (i.e., promoter, ribosomal 
binding site, and translation initiation sequences as published 
in Stover, et al. (1991), by PGR. The BCG HSP61 cassette (Figure 
22) contains 375 bases 5' to the BCG HSP6C* start codon, and 15 
bases (5 codons) 3' to the start codon. PGR oligonucleotide 
primers were then synthesized. Primer Xba-HSP60, of the 

following sequence: 

CAG ATC TAG ACG GTG ACC ACA ACG CGC C 
was synthesized for the 5« end of the cassette, and primer 
Bam-HSP61, of the following sequence: 

CTA GGG ATC CGC AAT TGT CTT GGC CAT TG 
was synthesized for the 3' end of the cassette. The primers were 
used to amplify the cassette by PGR from BCG strain Pasteur 
chromosomal DNA. The addition of the Bam HI site at the 3 ' end 
of the cassette adds one codon (Asp) to the first six codons of 
the HSP60 gene. 

Each of pMV206 and the PCR cassette HSP61 was digested with 
Xbal and BamHI. The PCR cassette was then inserted between the 
Xbal and BamHI sites of P OT206, then ligated to form plasmid 
PHV261. The construction of this plasmid is shown schematically 
in Figure 23. 

The 780 bp OspA PCR cassette as herein '/e described, was 
digested with BamHI and Sail, and ligated > BamHI and Sail 
digested pMV261 to form p261::0spA. 
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Example 5 

A DNA cassette encoding the promoters and transcription 
start sites, as identified in Stover, et al. (1991), ribosome 
binding site/ and start codon of the BCG HSP60 gene, was 
constructed by PCR. Such a cassette is the same as that of the 
BCG HSP61 cassette hereinabove described except that this 
cassette does not include the 15 bases (5 codons) 3' to the start 
codon. This cassette, which is 267 bp in length, and shown in 
Figure 24, includes added Xbal and Ncol sites, with a start codon 
inclitded in the Ncol site. The cassette, after construction, was 
digested with Xbal and Ncol. 

This cassette was placed into Xbal and Ncol digested pMV206 
to form pMV251. (Figure 25). A full length OspA gene (including 
the signal sequence and as published in Bergstrom, et al. (1989)) 
was then derived by PCR as an Ncol-Sall restriction fragment. 
This fragment was then digested with Ncol and Sail, and ligated 
to Ncol and Sail digested pMV251 to form p251::OspA. 

Example 6 

pRB26 was constructed as described in Example 1. The 38 kda 
antigen gene ribosomal binding site, start codon, and secretion 
signal sequence was obtained from M. tuberculosis chromosomal DNA 
and amplified by PCR with nucleotide primers. The resulting 
fragment also includes added Bglll-BamHI :EcoRI sites. The PCR 
fragment, 210 bp in length (Figure 26), is digested with Bglll 
and EcoRI and ligated into BamRI and EcoRI digested pRB26 to form 
p2638S (Figure 27). p2638S is then digested with BamHI and Sail. 
The 780 bp OspA PCR fragment described in Example 1 is digested 
with BamHI and Sail and ligated to the BamHI and Sail digested 
p2638S to form p2638: :OspA. 



Example 7 
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This example describes the formation of P 3638::OspA, which 
includes sequences encoding bacteriophage integration into a 
mycobacteria chromosome, DNA encoding the secretion signal of 
the 38 kda m ^rculosis antigen, as well as the OspA gene. 

PM7206 was constructed as hereinabove described in Example 

X * Plasmid pMH9.4, which includes the mycobacteriophage L5 attP 
site and the L5 integrase gene, was employed in providing the L5 
integration sequences to a BCG expression vector. The 
construction of pMH9.4, as well as its integration into 
smegmatis and BCG, is described below in sections (i) through 
(vi>. 

(i) Identificati Qi l " f * he DNA *«W nces of the attachment sites, 
atttL attL. and attR. of M. smegmatis. 

Using standard technologies, a lambda EMBL3^ library was 
constructed using chromosomal DNA prepared from mc 61 (a strain 
of M. smegmatis which includes an M. smegmatis chromosome into 
which has been integrated the genome of mycobacterial phage L5) 
and digested with Bam HI. Phage L5 contains DNA having 
restriction sites identical to those of phage LI (Snapper, et al. 
1988), except that L5 is able to replicate at 42°C and phage LI 
is incap-ble of such growth. This library was then probed with a 
6.7 kb DNA fragment isolated from the L5 genome that had been 
previously identified as carrying the attP sequence (Snapper, et 
al 1988). One of the positive clones was plaque purified, DNA 
prepared, and a 1.1 kb Sal I fragment (containing the AttL 
sequence) sub-cloned into sequencing vector pUC119. The DNA 
sequence of this fragment was determined using a shotgun approach 
coupled with Sanger sequencing. By isolating and sequencing the 
attL junction site and comparing this to the DNA sequence of L5 
that was available, a region was determined where the two 
sequences aligned but with a specific discontinuity present. The 
discontinuity represents one side of a core sequence, which is 
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identical in AttP, attB, and attL. The region contiaining the 
recombinational crossover point is shown in Figure 28. 

The attL DNA (1.1 kb Sal I fragment) was used as a probe to 

2 

hybridize to a Southern blot of Bam HI digested mc 6 DNA, which 

is a strain of M. smegma tis which includes an M. smegmatis 

chromosome without any phage integration (Jacobs, et al, 1987, 

hereinabove cited.)* A single band of approximately 6.4 kb was 

detected corresponding to the attB sequence of M. smegmatis . 

2 

This same attL probe was used to screen a cosmid library of mc 6 
(provided by Dr. Bill Jacobs of the Albert Einstein College of 
Medicine of Yeshiva University), and a number of positive cosmid 
clones were identified. DNA was prepared from these clones, and 
a 1.9 kb Sal I fragment (containing the attB site) that 
hybridizes to the attL probe was subcloned into pUC119 for 
sequencing and further analysis. The DNA sequence containing the 
core sequence was determined and is shown in Figure 28. The core 
sequence; which is identical in attP, attB and attL, has a length 
of 43bp. 

2 

. The mc 61 lambda EMBL3 library was then probed with the 
1.9kb Sail fragment containing the attB site. Positive plaques 
were identified, DNA was prepared, and analyzed by restriction 
analysis and Southern blots. Lambda clones were identified that 
contained a 3.2kb Bam HI fragment containing the putative attR 
site. The 3.2kb Bam HI fragment was purified and cloned into 
pUC119 for sequencing and further analysis. 

.( i i.) Determination of attP-integrase region of L5 genome . 

Concurrent with the above procedures, a significant 
portion of the DNA sequence of L5 had been determined and 
represented in several "contigs" or islands of DNA sequence. 
Sequences of the 6.7kb Bam HI fragment hereinabove described were 
determined by (a) analysis of the location of Bam HI sites in the 
contigs of the DNA of L5, and (b) by determining a short stretch 
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of DNA sequence from around the Bam HI sites of plasmxd P JR-1 
(Figure 33), which carries the 6.7kb Bam HI fragment of L5. 

A segment of DNA sequence was located that represented the 
6 7kb Bam HI fragment of phage L5. Studies of other phages have 
shown that the integrase genes are often located . close to the 
attP site. It was thus determined that the L5 integrase (int) 
oene should lie either within the 6.7kb Bam HI fragment or in a 
DNA sequence on either side of it. The DNA sequence in the 
regions was then analysed by translating it into all six possible 
reading frames and searching these amino acid sequences for 
similarity to the family of integrase related proteins, and 
through computer-assisted analysis of the DNA sequence. As shown 
in Figure 29, there are shown two domains of reasonably good 
conservation among L5 integrase and other integrases, and three 
amino acid residues that are absolutely conserved in domain 2. 
(See Yagil, et al., -T- «bl. Biol., Vol. 207, pgs. 695-717 (1989), 
and Poyart-Salmeron, et al. , J. EHBO. . Vol. 8, pgs, 2425-2433 
(1989)). A region was identified, and analysis of the 
corresponding DNA sequence showed a reading frame that could 
encode for a protein of approximately 333 amino acids. These 
observations identified the putative int gene. 

The location of the int gene was not within the 6.7kb Bam HI 
fragment; however, it was very close to it with one of the Bam HI 
sites (that defines the 6.7kb Bam HI fragment) less than 100 bp 
upstream of the start of the gene. Analysis of the Bam HI sites 
showed that the int gene lay within a 1.9kb Bam HI fragment 
located adjacent to the 6.7kb Bam HI fragment. This 1.9kb Bam HI 
fragment was cloned by purification of the fragment from a Bam HI 
digest of L5 DNA and cloning into pUC 119, to generate pMHl 
(Figure 34). 

From a combination of the above approaches, a schematic of 
the organization of the attP-int region of L5 was constructed 
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(Figure 30) , and the gene sequence of the attP-int region is 
given in Figure 31. 

f ili) Construction of pMH5 . 

The 6.7kb Bam HI fragment of myc ©bacteriophage L5, which 
contains the attP site, as hereinabove described, was cloned into 
the Bam HI site of pUC 119 (Figure 32). This was achieved by 
purifying the 6.7kb Bam HI fragment from a Bam HI digest of L5 
DNA separated by agarose gel electrophoresis and ligating with 
Bam HI cut pUC 119. DNA was prepared from candidate recombinants 
and characterized by restriction enzyme analysis and gel 
electrophoresis. A recombinant was identified that contained the 
6.7kb Bam HI fragment of L5 cloned into pUC 119. This plasmid 
was named pJR-1, as shown in Figure 33. 

Analysis of DNA sequence data from a project to sequence L5 
showed that a 1.9kb Bam HI fragment adjacent to the 6.7kb Bam HI 
fragment hereinabove described contained the integrase gene. 

A plasmid containing a 1.9kb Bam HI fragment containing the 
DNA encoding for the integrase cloned into the Bam HI site of pUC 
119 was constructed. The 1.9kb fragment was purified from a Bam 
HI digest of L5 DNA and cloned into the Bam HI site of pUC 119. 
Construction of the recombinant was determined by restriction 
analysis and gel electrophoresis. This plasmid was called pMHl, 
the construction of which is shown schematically in Figure 34. 

pJR-1 was then modified by digestion with BcoRI and SnaBI 
(both are unique cloning sites), between which is a Bam HI site. 
The Eco Rl-Sna BI fragment, including the Bam HI site was 
excised, and the plasmid was religated to form plasmid of pMH2, 
which contains on Bam HI site compared to two Bam HI sites 
contained in pJR-1. A schematic of the construction of pMH2 is 
shown in Figure 35. 

The 1.9kb Bam HI fragment, which includes the integrase 
gene, was purified from a Bam HI digest of pMHl and ligated to 



„ rn M PCT/US92/09075 
WO 93/07897 

30 

Bam HI digested pMH2. Recombinants were identified as above and 
the orientation of the 1.9kb fragment determined. A plasmid 
called pMH4 was thus constructed (Figure 36) in which the region 
from the Sna BI site (upstream of attP) through to the Bam HI 
site (downstream of the integrase gene) was identical to that in 
L5. 

pMH4 was digested with HindHI (unique site) and was ligated 
to a lkb Hindi 1 1 fragment purified from pKD43 (supplied by Keith 
Darbyshire of the Nigel Gindley Laboratory) that contains the 
gene determining resistance to kanamycin. Recombinants were 
identified and characterized as above. This plasmid is called 
pMH5. A schematic of the construction of pMH5 is shown in Figure 
37. 

( iv) Integration of pMH5 into attB of M. smegmatis. 

Plasmids pYUB12 (a gift from Dr. Bill Jacobs, a schematic of 
the formation of which is shown in Eigure 1), pMDOl (Eigure 38), 
and pMH5 were electroporated, with four different concentrations 
of plasmid DNA over a 1,000-fold range, into M. smegmatis strain 
mc 2 155, a strain which is able to support plasmid replication. 
In sections (iv) through (vi), all electroporation procedures of 
M. smeg matis . or of BCG, were carried out as follows: 

Cultures of organism were grown in Middlebrook 7H9 media, as 
described by Snapper, et al. (1988), harvested by centrifugation, 
washed three times with cold 10% glycerol, and resuspended at 
approximately a 100 x concentration of cells. 

1 ul of DNA was added to 100 ul of cells in an ice-cold 
cuvette and pulsed in a Bio-Rad Gene Pulser, and given a single 
pulse at 1.25 kv at 25 uF. 1 ml of broth was added the cells 
incubated for 1 hr. at 37°C for expression of the 
antibiotic-resistant marker. Cells were then concentrated and 
plated out on Middlebrook or tryptic soy media containing 15 
Kg/ml kanamycin. Colonies were observed after 3 to 5 days 
incubation at 37°C. 
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Each of pYUB12, pMDOl, and pMHS carries kanamycin 
resistance. Plasmid pYUB12 carries an origin of DNA replication, 
while pMDOl lacks a mycobacterial origin of replication. Plasmid 
pMH5 does not* carry a mycobacterial origin of replication, but 
carries a 2kb region of phage L5 which contains the attP site and 
the integrase gene (Figure 31). The number of transf ormants were 

linear with DNA concentration. Plasmid pYUB12 gives a large 

5 2 
number of transf ormants (2 x 10 per \ig DNA) in mc 155, while 

4 

pMHS gives 6 x 10 transf ormants per jig DNA, and pMDOl gives no 
transf ormants. 

The above experiment was then repeated by electroporating 

the plasmids pYUB12, pMDOl, and pMH5 into M. smegmatis strain 

2 

mc 6, which does not support plasmid replication. No 

2 

transf ormants in mc 6 were obtained from pYUB12 or pMDOl, while 

4 

pMHS gave approximately 10 kanamycin resistant transf ormants in 

2 

mc 6 per \ig of DNA, thus indicating integration of pMHS into the 
mc^6 chromosome. 

2 

DNA from six independent pMH5 transf ormants (four in mc 155 
2 

and two in mc 6) was prepared. These DNA 1 s (along with DNA from 

2 2 
both mc 155 itself, and mc 155 carrying the plasmid pYUB12) were 

digested with a restriction enzyme, and analyzed by Southern blot 

and hybridization with the M. smegmatis 1.9kb attB probe 

hereinabove described. As shown in Figure 39, all six 

transf ormants have integrated into the attB site, resulting in 

the production of two new DNA fragments with different 

mobilities. If pMHS did not integrate into the attB site/ it 

would be expected that a single band, corresponding to the attB 
2 

site in the mc 155 control, would be obtained, 
(v ) Construction of PMH9.2 and pMH9.4 

pUC119 was digested with Hindlll, and a Ikb Hindi I I 
fragment, containing a kanamycin resistance gene, purified from 
pKD43, was ligated to the Hindi 1 1 digested pUC119 to form pMH8 
(Figure 40). A 2kb Sail fragment (bp 3226-5310), which carries 
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the attP and integrase gene from Sail digested pMH5, was purified 
and inserted in both orientations relative to the vector backbone 
of Sail digested P MH8 to form plasmids pMH9.2 and pHBS.4 (Figures 
41 and 42). 

M- gnegmatis strain mc 2 155 cells carrying, as a result of 
electroporation, plasmid pYUB12, pMH9,2 or pMH9.4, or strain mc 6 
cells carrying plasmid pMH5, as a result of electroporation as 
hereinabove described, were grown to saturation in broth with 
kanamycin. Cultures were then diluted 1:100 into broth without 
kanamycin and grown to saturation. Two further cycles of 
dilution and growth were done, corresponding to about 20 
generations of bacterial growth. Cultures were plated out to 
single colonies on non-selective plates, and approximately 100 of 
these colonies were patch plated onto both non-selective and 
selective plates. The % of colonies that were sensitive to 
kanamycin, thus corresponding to the percentage of cells which 
lost the plasmid, is given below in Table I. 

Table I 

% loss 

pYUB12 (mc 2 155) 35 

pMH5 (me 2 6) 17 

pMH9.2 (mc 2 155) 3 

pMH9.4 (mc 2 155) 0 
(yi) T ransformation of BC G with PMH9.4 

The 1.9 kb Sal I fragment, which includes the M. smegmatis 
attB site as hereinabove described was cloned into pUC119, and 
the plasmid generated was named pMH-12. (Figure 43). 

Gel purified Sal I 1.9kb M. smeomatis fragment containing 
attB (isolated from pMH-12) was used to probe a Southern transfer 
of Bam HI digested mycobacterial DNA's, including BCG substrain 
Pasteur, shown in Figure 44. This demonstrated that there is one 
Bam HI fragment of BCG that strongly hybridizes to the 
smegmatis attB probe and three hybridize weakly. The strongest 
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hybridizing band is the fastest moving band (approximately 1.9 
kb). 

The same probe as above was used to probe a BCG cosmid 
library (provided by Dr. Bill Jacobs) and positive clones were 
identified. DNA was prepared from several positive clones and 
analyzed by restriction analysis and Southern blotting. The 1.9 
kb Bam HI fragment (corresponding to the strongly hybridizing 
band in the Southern blot was identified, gel purified from the 
cosmid DNA and cloned into pUC119. The resulting plasraid was 
named pMH-15. (Figure 45). 

Plasmid pMH-5 and pMH9.4 were electroporated into BCG 

Pasteur. It was observed that pMH9.4 transforms BCG with high 

4 

efficiency (approximately 10 transf ormants/^g DNA), while pMH-5 

transforms BCG at low efficiency (1-10 transformants/ng DNA). 

DNA was prepared from BCG transf ormants and analyzed by Bam HI 

restriction and Southern blot analysis, probing with gel purified 

1.9kb Bam HI BCG attB fragment from pMH-15. These data are shown 

in Figure 41 and show that integration of both pMHS and pMH9.4 is 

specific to the BCG attB site (ie. the strongly cross-hybridizing 

fragment in BCG). This is illustrated by the loss of the 1.9kb 

Bam HI fragment from the transformants and the appearance of two 

new bands representing attL and attR junction fragments. Figure 

46 shows just one of the pMH5/BCG transformants, although all of 

the four that were analyzed show that one of the bands (the 

largest) is smaller than expected (and different in each of the 

transf ormants) , indicating that the transformation effiency of. 

pMH-5 is low in BCG. In contrast, the four pMH9.4 transformants 

are identical to each other (Figure 46) and give attR and attL 

junction fragments of the predicted sizes. 

Plasmid pMV206 was digested with Not I to remove the 

mycobacterial replicon. The resulting 2209 bp fragment, which 

p 

includes the aph (Kan ) gene, the E. coli replicon and the 
multiple cloning site, was ligated and recircularized to form 
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pMV205, the construction of which is schematically depicted in 
Figure 11. 

PCR with primers Xbal-Att/Int and Nhel-Att/Int was then 
performed on a Sal I fragment from pMH9.4, which contains the 
attP site and the L5 integrase gene. The resulting cassette was 
then digested with Xbal and Nhel and a 1789 bp fragment was gel 
purified. pMV205 was then digested with Nhel, and the resulting 
fragment was ligated to the 1989 bp fragment obtained from pMH9.4 
to form pMV306. A schematic of the construction of pMV306 is 

shown in Figure 47. 

P 2638::OspA (from Example 6) and pMV306 were each digested 
with Xbal and Sail. The Xbal-Sall fragment of p2638:OspA, which 
contains the HSP60 promoter, 38 kda secretion signal sequence, 
and OspA antigen sequence, was ligated into Xbal and Sail 
digested pMV306 to form p3638:tOspA. 

Example 8 

P RB26 was constructed as described in Example 1. The 32 kda 
dL -antigen gene of M. tuberculosis or BCG (Matsuo, et al., «L. 
Bacteriol, Vol. 170, No. 9, pgs 3847-3854 (Sept. 1988); 
Borremans, et al.. Infect, an d Immun., Vol. 57, No. 10, pgs. 
3123-3130 (Oct. 1989)) was obtained from BCG chromosomal DNA and 
amplified by PCR using primers including added Bglll-BamHI :EcoRI 
sites. The PCR fragment, 420 bp in length (Figure 48), was 
digested with Bglll and EcoRI, and ligated into. BamHI and EcoRI 
digested pRB26 to form pAB261 (Figure 49), which contains the 
entire^ -antigen gene. pAB261 was then digested with BamHI and 
Sail, and the 780bp PCR OspA cassette hereinabove described in 
Example 1, was also digested with BamHI and Sail, and was ligated 
to BamHI and Sail digested pAB261 to form pAB261: :0spA. 



Example 9 
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Plasmid pMV206 was constructed as hereinabove described in 
Example 1. 

A partial sequence of the 5 1 region of the CCG HSP70 gene 
(which encodes the BCG HSP70 heat shock protein/ also known as 
the 70 kda antigen) obtained by Dr. Raju Lathigra (Medical 
Research Council, London) permitted the construction of a 
cassette carrying the promoter sequence. The HSP70 promoter was 
amplified by PCR with primers including Xba and Nhel sites. The 
HSP70 promoter PCR fragment, 121 bp in length (Figure 50), was 
digested with Xbal and Nhel, and ligated to Xbal digested pMV206 
to form pRB27. (Figure 51.) The 32 kda -antigen gene of BCG 
was obtained from BCG chromosomal DNA as described in Example 8, 
and amplified by PCR using primers including added 
Bglll-BamHI: EcoRI sites. The PCR fragment was digested with 
Bglll and EcoRI, and ligated into BamHI and EcoRI digested pRB27 
to form pAB271 (Figure 52), which contains the entire #: -antigen 
gene. pAB271 was then digested with BamHI and Sail, and the 
780bp PCR OspA cassette hereinabove described in Example 1, was 
also digested with BamHI and Sail, and was ligated to BamHI and 
Sail digested pAB271 to form pAB271 : :OspA. 



Example 10 

Vectors pl9PS: :OspA, m p38PS: :OspA, pMV261: :OspA, and 
pMV251::OspA were transformed into BCG. The transformed BCG 
cells were cultured, and the cells were then sedimented from the 
cultures. The cells were then suspended in phosphate buffered 
saline (PBS) , and cell suspensions were normalized to equivalent 
densities. The cells were disrupted by sonication, the cell 
envelopes were sedimented, and the supernatant (a 
Cytosol-enriched fraction) was saved. The cell envelopes were 
resuspended in PBS, and membranes were solubilized at 4°C by the 
addition of Triton X-114 to 2% (vol./vol.). Insoluble material 
(a cell wall-enriched fraction) was sedimented, and the 
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supernatant (nMbrM-enrlched fraction) was removed. Triton 
X-H4 was added to the Cytosol-enricbed fraction. After brief 
warming of the Triton X-114 solutions at 37«C. separation of 
aqueous and detergent phases „,s achieved by a short 
centrifugation. These two phases were bach-extracted thro" 

bT e L m add?T lM T reP " Seatati « pracipitated 
by the addition of acetone. A portion of each culture 
supernatant was concentrated by an ultrafiltration device 
(Centrzc.n-30. Amicon). Samples representing culture volume 
equxvalents were processed by SDS-PAOE, transferred to 
nitrocellulose, and Western blotted with anti-OspA monoclonal 
antibody (Mab) HS332. (Howe et al T n e~* / clonal 
„ , 11 ec Jnfect. and Inmnn V ol. 

54. Ho. 1. pgs . 207-212 (Oct. 1986),. Filter-bound antibody was 
visualised with an enhanced chemi luminescence, system (Amersnam, 

(LinZ „ ln / i9Ure ^ 1 15 3 " 0leeUl " standi 

(Rainbow Barkers, Amersham); lane 2 is a whole cell sonicate 

fraction; lane 3 is Triton X-114 insoluble material; lane 4 is 

-et! iane , is the' d^gen^ ~ ^ ^ 
lane 8 is a concentrated culture medium. 

fusions? " £r ° m ** ' reco " bl — t chimaric OspA 

and eXPreMed fro » «*• expression vectors pl«s.. J* 

Triton X-114 phase from the membrane fractim,* *u 

that thes. recombinant OspA pro Jnstre' tLTT 

=ri„i^rs^r^°- - — — 

additional OspA was also ^ ZtT 

fractions, thus suggesting that the OspA sCfTas ~ 
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efficiently processed in BCG as were the 19 kda and 38 kda signal 
sequences. Recombinant OspA expressed by pMV261 : :OspA, wherein 
OspA was not fused to a lipoprotein signal, was found to be 
localized only in aqueous cytoplasmic fractions. 

Example 11 

BCG cells were transformed with either pAB261: :OspA, 
pAB271: :OspA, or pMV261::OspA and cultured. Portions of BCG 
culture supernatants were depleted of bovine serum albumin (BSA) , 
a component of the medium, by adsorption with Affi-gel Blue (Bio 
Rad). BCG cell pellets from the cultures were suspensed in PBS 
and sonicated. Adsorbed or unadsorbed supernatants were 
concentrated (Centricon 30) and then diluted to the same relative 
concentration, on a culture volume basis, as the lysed cells. 
Samples were used for SDS-PAGE and subsequent immunoblotting with 
anti-OspA (Mab H 5332), anti-Hsp70 (Mab IT-41, WHO mycobacterial 
monoclonal antibody bank), or anti-Hsp60 (Mab IT- 13, WHO 
mycobacterial monoclonal antibody bank). As shown in Figure 54, 
lane M.W. Std. is a molecular weight standard, lanes W are whole 
cell lysates, lanes S are culture supernatants (unadsorbed), and 
lan** A are adsorbed supernatants. As shown in Figure 54, it was 
determined that fusion of the OspA gene, without the secretion 
signal, to the complete o-antigen gene resulted in high level 
expression, and a substantial fraction of the resulting 
recombinant a-antigen-OspA fusion protein was found to be 
excreted into the culture media. The absence of detectable 
quantities of cytoplasmic proteins (Hsp60 and Hsp70) in the 
supernatant indicated that cell lysis was minimal, and that the 
recombinant e-antigen: :0spA fusion protein was specifically 
targeted to be secreted and is not simply found in culture 
supernatants due to autolysis. 
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Example 12 

BCG organisms (Pasteur strain) were transformed with one of 
the following vectors: 
pMV261;i0spA 
pMV251::OspA 
p2619::OspA 
pl9PSr :OspA 
p38PS: :0spA 
p3638: :OspA 
pAB261:rOspA 

As a negative control, pM7261/LZ was used to transform a 
control group of organisms. pMV261/LZ was constructed by cloning 
a BamHI restriction fragment carrying the E^coli lacZ gene (which 
encodes fl-galactosidase> into the BamHI site of Bam HI digested 
pMV261. 

The transformed BCG colonies were isolated by selection for 
kamamycin resistance and expanded in liquid media culture for 
further analysis- Recombinant BCG samples representing culture 
volume equivalents were processed by SDS-PAGE, transferred to 
nitrocellulose, and Western blotted with anti-OspA Mab H5332. 
Positive controls employed were a processed sample of 
B^burgdorferi strain B31, and samples of OspA antigens in 
concentrations of 500 ug/ml, 100 ug/ml. and 20 ug/ml. The 
filter-bound antibody was visualized with an enhanced 
chemilumineseuce system (Amersham). As shown in Figure 55, OspA 
was expressed by BCG transformed with vectors including the OspA 
gene. Figure 55 also shows the expression of a fusion protein of 
OspA and a mycobacterial secretion signal by BCG transofmred with 
p2619::0spA, pl9PS: :OspA, p38PS::OspA, or p3638: :0spA. 

Example 13 

BCG organisms were transformed with pHV261: :0spA, and the 
transformed organisms were cultured. Twenty- four different 
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strains of mice, with five mice representing each strain, were 
immunized with a single dose of 1 x 10 6 CFU of BCG transformed 
with pMV261::OspA (post freeze titer of 42%) intraperitoneal^ . 
The mice were bled every four weeks for 16 weeks, and also at 19 
weeks. Sera were analyzed by ELISA on whole cells of Borrelia 
burgdo rferi and BCG lysate coated on wells. The reaction was 
developed with peroxidase conjugated anti-mouse immunoglobulin 
and substrate. Color development was read as absorbance at 
405nm. Positive sera had optical density (O.D. ) values at three 
standard deviations above the mean of the prebleed sera. At 17 
weeks, the mice were given a booster intraperitoneal injection of 
1 x 10 6 CFU of BCG transformed with pMV261 : :OspA. 

As shown in Figures 56 and 57, the following strains: 

A/He J 

A/J 

AKR/J 

BALB/cByJ 

CBA/J 

C3H/HeJ 

SJL/J 

LP/J 

129/J 

CE/J 

B10.BR/SgSnJ . 

D4 Swiss Webster 

SenCar 

FVB 

showed an immune response after a single immunization, and the 
following strains: 

A/He J 

A/J 

C3H/HeJ 
129/J 
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CE/J 

B10.BR/SgSnJ 

D4 Swiss Webster 

had responded" with significantly high levels, of antibody against 
a>vt-T- «»i -i a burgdorferi . 

Example 14 

BCG organisms transformed with either pMV261 : :OspA, 
pAB261: :OspA, P I9PS::OspA, or pMV251: :OspA, plus non- recombinant 
BCG Pasteur organisms were subjected to cell fractionation and 
Triton X-114 detergent phase partitioning analysis (Bordier, et 
al; - T Mai. Chem. . Vol. 256, pg. 1604 (1981); Radolf, et al., 
infect and Immun.. Vol. 56, pg. 490 (1988)) to determine if 
expression of OspA genes in the vectors hereinabove described 
resulted in export and lipid acylation of recombinant OspA 
protein. 

Recombinant BCG cells wee sedimented from BCG cultures, 
suspended in phosphate buffered saline (PBS) , and cell 
suspensions were adjusted to equivalent densities. Cells were 
disrupted by sonication and mebranes were solubilized at 4°C by 
the addition of Triton X-114 to 2% (vol./vol.). Insoluble 
material (cell wall enriched fraction) was centrifuged, and the 
supernatant was subjected to detergent phase partitioning. After 
briefly warming (37°C) the Triton X-114 solutions, separation of 
aqueous and detergent phases was achieved by a short 
centrifuguation . The two phases were back- extracted three 
times, and proteins in representative samples were precipitated 
by the addition of acetone. A portion of each culture 
supernatant was concentrated by ultrafiltration ■. Samples 
representing 5- fold concentrated culture volume equivalents were 
processed by SDS-PAGE, transferred to nitrocellulose and blotted 
with anti-OspA MAb H5332. (Figure 58). Similar fractions from 
non-recombinant BCG were blotted with appropriate monoclonal 



WO 93/07897 



41 



PCI7US92/09075 



antibodies specific for the BCG or M. tuberculosi s Hsp60 protein 
(IT13), o antigen (HYT27) , or M. tuberculosis 19kda antigen (HYT6) 
to determine the cellular location of the native fusion partners. 
As shown in Figure 58, lane W is a whole cell sonicate fraction; 
lane I is a Triton X-114 insoluble cell wall enriched fraction; 
lane A is a cyto sol -enriched aqueous fraction; lane D is a 
detergent phase (membrane-enriched) fraction and lane M is a 
5-fold concentrated culture medium fraction. 

As shown in Figure 58, the OspA gene product encoded by 
pMV261::OspA was found excessively in the aqueous cytosolic 
fraction (lane A) and correlated with the exclusive cytoplasmic 
location of HSP60. The a-antigen-OspA gene product expressed by 
pAB261::OspA and the native BCG a-antigen were found in the 
insoluble cell wall enriched fraction (lane I), aqueous cytosolic 
fraction (lane A), and media fraction (lane M), but not in the 
detergent soluble lipoprotein-enriched fraction (lane D). The 
presence of the a-antigen in the recombinant BCG culture media 
was not due to recombinant BCG autolysis, as HSP60 was not found 
in the culture media. Compared to the native BCG a-antigen, a 
substantially smaller fraction of the fusion protein expressed by 
pAB261::OspA was secreted into the media, while a larger portion 
was found in the cell wall enriched insoluble fraction. This 
suggests that fusion to the a-antigen could also direct foreign 
antigens to the cell wall. Substitution of the M. tuberculosis 
19kda antigen signal peptide for the OspA signal peptide resulted 
in expression of a chimeric OspA protein that was located almost 
exclusively in the detergent soluble fraction. This finding 
indicated that fusion of the M. tuberculosis 19kda antigen signal 
peptide to OspA did direct efficient expression and export of the 
OspA protein to the membrane of BCG, This result was in contrast 
to the product expressed by organisms transformed with 
pMV251: :OspA, where most of OspA was found in the aqueous 
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fra-«-ion, which may have been due to inefficient processing of 
the native Borrelia signal peptide. 

Example 15 

The recombinant BCG organisms of Example 14 were analyzed by 
flow cytometry to determine if the recombinant OspA gene products 
were accessible on the surface of recombinant BCG to anti-OspA 
antibody. 

Approximately 2 x 10 8 recombinant BCG organisms grown in 
Dubos media supplemented with albumin-dextrose complex and 0.05% 
Tween 80 were harvested by centrifugation. The pelleted 
recombinant BCG organisms were washed with 10 ml. of phosphate 
buffered saline ( P H 7.4) containing 0.05% Tween 80 (PBS-T80) , 
resuspended in 5 ml. PBS-T80, and fixed for 10 minutes in 2% 
paraformaldehyde. Fixed recombinant BCG organisms were pelleted 
and washed twice with 5 ml. PBS-T80, and then resuspended in 1 
ml. of PBS-T80. Polyclonal rabbit sera specific for OspA 
(BCG- adsorbed) was added to the fixed recombinant BCG cell 
suspension to a final dilution of 1:200 and incubated for 30 
minutes at room temperature and 30 minutes on ice. The 
suspension was then pellted by centrifugation, washed twice with 
0.5 ml. PBS-T80 and resuspended in 1 ml. PBS-T80. Goat 
anti-rabbit FITC- conjugated secondary antibody was added to a 
final dilution of 1:50 and incubated for 30 minutes on ice. The 
recombinant BCG-secondary antibody suspension was pelleted by 
centrifugation, washed twice with 1 ml. PBS-T80 and resuspended 
in 2 ml. PBS-T80. Labeled recombinant BCG were mildly sonicated 
to disperse clumped cells and dilutions were analyzed by flow 
cytometry on an FACS scan (Becton-Dickinson) . Recombinant BCG 
containing the designated plasmids and expressing the designated 
chimeric OspA gene products are compared to non- recombinant BCG. 
(Figure 59). 
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As shown in Figure 59, recombinant BCG organisms expressing 
OspA from plasmids pl9PS:0spA, pMV251 : :OspA, and .pAB261; :OspA, 
all demonstrated increased surface fluorescence with anti-OspA 
sera when compared with non- recombinant BCG or recombinant BCG 
expressing OspA from plasmid pMV261: :OspA. The relative surface 
fluorescence exhibited by expression of OspA from organisms 
transformed with pMV251::0spA was less than that observed for 
organisms transformed with pl9PS:;OspA, and was in agreement with 
the fractionation analysis of Example 14. The recombinant BCG 
expressing OspA from pAB261::OspA also exhibited surface 
f lourescence, thus confirming that the a -anti gen- OspA fusion 
protein found in the Triton insoluble fraction (Example 14) was 
cell wall associated and not derived from insoluble inclusion 
bodies. Therefore, it was possible to export OspA to the surface 
of BCG as a membrane- associated lipoprotein by fusion to the M. 
tuberc ulosis 19kda antigen signal sequence , or as a secreted and 
cell wall associated protein by fusion to the a-antigen. 

Example 16 

C3H/He, BALB/C, and Swiss Webster mice were immunized with 
10 6 colony forming units of BCG organisms transformed with 
pMV261: :OspA, pMV251: :OspA, pl9PS: :OspA, pAB261 : : OspA, or of 
non- recombinant BCG Pasteur, The mice were given a booster of 
the identical dose at 16 weeks. As shown in Figure 60, all three 
mouse strains immunized with BCG transformed with pMV251::OspA or 
pl9PS::0spA exhibited strong OspA- specific antibody responses 
within 4 to 8 weeks after a single immunization as measured by 
ELISA to whole Borrelia organisms or purified OspA. Particularly 
striking were the anti-OspA responses elicited by a single dose 
of BCG organisms transformed with either pMV251::OspA or 
pl9PS::0spA; in the low responder Swiss Webster strain; the same 
strain of mice immunized with BCG transformed with pMV261::OspA 
or pAB261::OspA did not mount anti-OspA responses even after 
boosting. Peak anti-OspA antibody titers exceeding 1:10 5 in 
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BALB/C and C3H/He mice, and. 1:10 4 in Swiss Webster mice were 
elicited by boosting with BCG transformed with P MV251::OspA or 
pl9PS:rOspA, and these responses were 100 to 1,000- fold higher 
than the responses induced with BCG transformed with P MV261::OspA 
or pAB261: :OspA. 

Example 17 

Immune sera from the immunized C3H/He and BALB/C mice of 
Example 16 were analyzed for their ability to inhibit growth of 
the non-pathogenic B31 laboratory strain of B. burdorferi in 
culture in two independent experiments. (Sadziene, et al., jL_ 
Infe ct, Diseases , in press (1992). Growth inhibition titers for 
each of the immune sera are given in Table I below: 

Table I 



House 


Vector 




Titer 


Strain 




Experiment 1 


Experiment : 


BALB/C 


pMV261r :OspA 


<8 


N/A 


BALB/C 


pMV251: :0spA 


4096 


8924 


BALB/C 


pl9PS: :OspA 


1024 


16384 


BALB/C 


pAB261: tOspA 


N/A 


N/A 


BALB/C 


none (Control) 


<8 


<8 


C3H/He 


pMV261: ;0spA 


32 


N/A 


C3H/He 


pMV251:iOspA 


1024 


32768 


C3H/He 


pl9PS: :OspA 


2048 


16384 


C3H/He 


pAB261: :OspA 


256 


N/A 


C3H/He 


none (Control ) 


<8 


<8 



The above results show that antisera obtained from mice 
immunized with BCG transformed with pMV251:r0spA or pl9PS: :OspA 
exhibited strong growth inhibition titers while sera derived from 
mice immunized with BCG transformed with P MV261::OspA showed 
lower or undetectable growth inhibition titers. 

C3H/He and BALB/C mice immunized with the BCG organisms 
hereinabove described were then challenged 



with either 



1Q 6 B . burgdorferi strain Sh 2 organisms intraperitoneal^ (IP) or 
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10 organisms intradermal ly (ID). The B . burgdorferi organisms 
were administered 5 weeks after a booster immunization of 10^ 
transformed BCG organisms. The mice were sacrificed 14 days 
after the B . burgdorferi challenge, and plasma, and bladder tissue 
were cultured in BSKII media. (Schwan, et al., J. Clin. 
Microbiol, Vol. 20, pg. 155 (1984)). Cultures were monitored 
through day 14 by phase contrast microscopy for the presence of 
spirochetes. The presence of one or more spirochetes per 20 high 
power fields in any one of the plasma or tissue cultures was 
scored as an infection. The fraction of the challenged mice 
exhibiting positive infections in the IP, and ID challenges are 



given in 


Table II below. 


Table II 




Mouse 
Strain 


Vector 


No. of Infections 






IP 


ID 


BALB/C 


pMV261: :0spA 


5/5 


N/A 


BALB/C 


pMV251: :OspA 


0/5 


0/5 


BALB/C 


pl9PS: :OspA 


0/5 


0/5 


BALB/C 


pAB261: :OspA 


4/5 


N/A 


BALB/C 


none (Control) 


4/4 


4/4 


C3H/He 


pMV261: -.OspA 


3/4 


N/A 


C3H/He 


pMV251: :OspA 


0/5 


0/5 


C3H/He 


pl9PS: :OspA 


3/5 


0/5 


C3H/He 


pAB261: :OspA 


3/5 


N/A 


C3H/He 


none (Control) 


5/5 


5/5 


The 


above results show 


that all control mice 


were 



be infected, whereas the mice that were immunized with BCG 
transformed with pMV251::OspA or pl9PS::0spA were protected from 
infection. 

It is to be understood however, that the scope of the 
present invention is not to be limited to the specific 
embodiments described above. The invention may be practiced 
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other thai, as particularly described and still be within the 
scope of the accompanying claims. 
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WHAT IS CLAIMED IS: 

1. An expression vector for expressing a protein or 
polypeptide- or peptide in a bacterium, comprising: 

a first DNA sequence encoding at least a secretion signal of 
a lipoprotein and a second DNA sequence encoding a protein or 
fragment thereof, or polypeptide or peptide heterologous to the 
bacterium which expresses the protein or fragment thereof or 
polypeptide or peptide, whereby said bacterium expresses a fusion 
protean of a lipoprotein or lipoprotein segment and said protein 
or fragment thereof, or polypeptide or peptide heterologous to 
the bacterium which expresses the protein or polypeptide or 
- peptide. 

2. The expression vector of Claim 1 wherein the bacterium 
is a mycobacterium. 

3. The expression vector of Claim 1 wherein said first DNA 
Z^~ S ^ l6aSt 8 SeCreti ° n Si - al * * —bacterial 

«• The expression vector of Clsim 3 wherein said 
mycobacterial apoprotein is an M^tub^cul^ lipoprotein 

S The expression vector of Claim 4 wherein said M 

Se iTsd- 212 " P0Pr0t ' ln 13 -~ *"„ P consisting t 
the 19 kda and 38 kda antigens. 

forth!' The , eXPre " i0 " V, ^° r ° f * "herein said vector 

further comprises a mycobacterial origin of replication 

furth^ Th<! ' XPr " Bim Vector ° £ cl »l" 2 "herein said vector 

inte^aVT^" * ° eqUenCe enC ° din9 "y~b.ct.no,*.,. 

integration into a mycobacterium chromosome. 

thereof T^" " 1 ^"'^ "** Pr0teln « 

hereof, or polypeptide or peptide heterologous to the bacterium 

which expresses the protein or fragment thereof, or polypeptide 

f 0 r r agmtt de d T "* ^ ^^^LZ ' . 

fragment or derivative thereof. 

9. A mycobacterium transformed with the vector of Claim 2. 
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10. The transformed mycobacterium . of Claim 9 wherein the 

mycobacterium is BCG. 

11. A pharmaceutical composition comprising: 
the mycobacterium of Claim 9; and 

an acceptable pharmaceutical carrier. 

12. The expression vector of Claim 1 wherein said vector is 
a plasmid. 

13. The vector of Claim 12 wherein the vector is a shuttle 
plasmid, and further comprises a bacterial origin of replication. 

14. A method of protecting an animal against Lyme disease, 
comprising: 

administering to an animal mycobacteria transformed with DNA 
which includes at least one DNA sequence which encodes a protein 
or polypeptide which elicits antibodies against Borrelia 
burgdorferi, said mycobacteria being administered in an amount 
effective to protect an animal against Lyme disease. 

15. The method of Claim 14 wherein said at least one DNA 
sequence encodes a surface protein of Borrelia burgdorferi or a 
fragment or derivative thereof. 

16. The method of Claim 15 wherein said surface protein of 
Borreli a burgdorferi is selected from the group consisting of 
Outer Surface Protein A and Outer Surface Protein B. 

17. The method of Claim 14 wherein said mycobacteria are of 
the species M. bovis -BCC. 

18. A composition for protecting an animal against Lyme 

disease, comprising: 

mycobacteria transformed with DNA which includes at least 
one DNA sequence which encodes a protein or polypeptide which 
elicits antibodies against Borrelia burgdorferi; and. 

an acceptable pharmaceutical carrier, said mycobacteria 
being present in an amount effective to protect an animal against 
Lyme disease. 
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19- The composition of Claim 18 wherein said at least one 
DNA sequence encodes a surface protein of Borrelia burgdorferi or 
a fragment or derivative thereof. 

20. The* composition of Claim 19 wherein said surface 
protein of Bogrelia burgdorferi is selected from the group 
consisting of Grater Surface Protein A and Outer Surface Protein 
B. 

21. The aonposition of Claim 18 wherein said mycobacteria 
are of the species M. bovis -BCG. 

22- An expression vector for expressing a protein or 
polypeptide in a bacterium which includes a DNA sequence encoding 
at least a secretion signal of a lipoprotein. 
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S/C3 

Fl O. Sex. 

e 

I • 

GCTAGCItTATATGCTrTGATGCAAATTTOATGC^^ 

1 ♦ + ♦ + " — 

CGATCXXSATATAOGCAACTVUXTra^ 

B*m - M rep H,rn,q,E> 
rrzir^irrjLTrr.xcTAracr^ATrAvr^^^cT ^ r\rrrcTrrnrvrr,A'Tr 

101 + + — 

OCroCXnGATAOCTGATCOSCTAGTAOOGCIX X TlUW^ 
TTCTCICCOClCOOOCXXnTCCXJTCGCXl^^ 

201 ♦ ♦ ♦ • + " 

AACAGACGCAGGGGCGCAACCC4COGOCACCTAOCTOWXXXXX1X3GAOCT 

ATTOGAGCCAATCAATTCITGCXXlVGAACTlTn^ 

301 ■ — ♦ ♦ ♦ + 

TAACXJITXGTrACTrAAGAACG<XTCITGACACTTACXXX7 

I,la-,s ' 

GCATCKXXXX^CXJlrTlU/llXJIXC^ -0 

401 ♦ + ♦ ♦ ° 

CCTAGACCDCX7nXCAAaXACGAa3GGTtX3CCACC 0 

GAAlXLVATCACO^TACCCCAGCXLiACGTCl^GCGACTGCI IX'IGCAAAAO 

501 4 + + --■ + + 

CmcmCltXXTATCCGCItXXTIXX^^ x 

TAAACTICraiiAACGOCXlAACTCAGCGOCXnXXAC^ h 

601-- ------- --- ♦ - ...... + ....••... + .. — 

ATITDVGACCITnXXXXnTCACriTXX^^ > 

TAACGAAGOCCTGGCATTCACtXTGAGTGAl'I'IIlCTCTGGT^ X 

701 ♦ — * — — — + " ♦ — — y 

ATI UC 1 1 LlAXiACXXTrAACTCGGACTCACrAAAAAGAGACXZACGCCCXXXrr^ ^ 

TTCATCATCAGTAACCCCTATCCTXMCXIA'IXX^ 2 

801 «• ♦ ♦ ♦ - 

AACTACTACTCATTGGflCATACC^CTtXnAGCAC ACAC XIAAAGTAGCCATA 

OCAAAC^GGAAAAAA(XXAXX.'l'l AACATGGCJLX.lA.'l I'lATCAGAAOCCAGA 
901 ■ + ♦ ♦ ♦ 

amrenocn 1 1 1 ixxxxxxiAATnrrACOGGGCGAAAYACTcnTcccTci 

M&CKTCIGIGAATOQCTICAa^ 

1001 ♦ ♦ ♦ 

TCTXHACACACmaXSAAGTCCTUCT^ 

CACICAACXTTCGAAGCG"! XJl XASl'l lXXXL\COCATCTAGCAAOCACAC3GAA^ 

1101 ♦ • - - - + 

CK^GITTCAACnPOCACAOCAAOGOCTCCCTACATDCl rGGTG TCC TT 
GAACIGACrcCTATCGTItKTrAMOCTACT^ 
1201 ♦ ♦ ZZZZl "*"' 

CTCXl\C7rcAGCATAGCMOCATTTCGATCAAACrc 

ACGAGAGTCGCCACCGATG<XA<XACAAG^ 
1301 ♦ ♦ ♦ + 

Tcxnxnt^axx?rcccTACGcnxnxrnrGTGATGT^^ 

CGAAATGCCTTCGTATCC^CCAAGATTCCT^ 
urn. .. + + - - .4... ........ 4.. 
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FI O. "5b 



♦ ♦ * + + 100 

fiACAGCXTGGCXlAAMXXXlCGGCGGGICAGGXCCA^ 

— ^ N.av.l!] ■ 

aaaCCOGGACCCATOTITXX^ 

AC\TOCOC X X n!GO C TA CCAIOOCCOOCT 200 

OCTCUAICGAACXXXXXXXraCCICC^ 

' ♦ ♦ ♦ + 300 

OClA(nTACCTTCGGCCXXXXnGGAGCGATTXXtt 

OCC4TMCATATOLiTCXXXnaXXX>TCrcCAOC^ 

♦ ♦ ♦ * + 400 

OCIC TTG T ATAGGrACCCCAGCCXXnACACGTOGTCCyjOGTGCXiOOG 

# 

. TCXTrTCLW^aXCGCTAGGCnX^XGGCTIX^ 

0* ♦ ♦ ♦ + 500 

«0 ACACCAACTCC1TXXXXXATCXXA(X^^ 

CTCTCOGACXnX^CCAACAACATGAATCGTCI rOCCTI POOGTGTITOG 

2 ♦ ♦ ♦ + 600 

L_ AGACGCTXK^ CICClTGTIGTA CTrAOCACAAGOCAAAGGCACAAAGC 

x CX^TCGCACGATCCIXXntXXnAOXnCIGG^ 

t OTAGarTOTA^aL^ 700 

^ CCATAOCCCCACTItTITTAOOCTCACAAOClTOCAGTAACO^ 

X + ♦ + + + 800 

<j GGTATGGCOGTCAACAAATGGGAGTGTIGCAAGGTCATTGGOCCG1AC 

ATTACCCOCATGAACAGAAA'1'1 ULXXtJl 1 ACACXXLVGGCATCAAGTGA 

5 ♦ » + 900 

GTAATGOCGGTACUllVlCl'llAAGGGGGAATGrlXTOClOCGTAGTICACT 

CATTAA CUCnUi IjG AGAAACIXIAAOGACCTGGACCCGGATGAACACCC 

+ m*± ♦ iooo 

GTAATTCCGAACACCICITTGACTIXXnXXACCKXXX^ 
GAGGAC4GTCCCACGACCAAGTTCITCITX14TCT 

1 CltXTGTCAGCGTTXnXXnTQlACAAGAOCT^ 1100 

CATGCGCAACGAACXXXXX^ACGAACAAOGCCrAGAAC^ 

+ + + + 1200 

IYTTACGCCTITXrXTCGGCXXTITCXniTnTC 

anTCCGTGAGCTCICAAa«XXXXnGTAAaX^ 

......... + . ...... ..+. ....... .+.. ...... .......+ 1300 

GCAAG<XACIXX;ACAC7ITGOCOCGGACATI^^ 



TACATCACCACAA(XACCGATTCTTXXXCT^ iTATTCACC 

+ ♦ * ♦ 1400 

ATGTACTCXntTITCGTGCTAAGACCCCCACTOGAGGTGv TATAAGTCG 

GTATTCAAAAOGGAOGCAAOGAAACAOGCAACCAGACAGGCATGGOOC 
♦ + ....» + 1500 

a*TM GTrncocraxnra3T^ 



7/^3 

AAACCAGAAAACX400CIClAOCACGACTlTrACCIX nO OCAOQDC7rrGCAA 

1501 ♦ ♦ ♦ 

TTIGGTCrrnGAZFCGCkGAJQGJGCPGAAAATGGACAGQCICGGCAACGtt 
GAACAGCCGTCX^TIGTCCGCTItXnTCITXX^ 

1601 ♦ ♦ + + . 

CTIGTaiGCkCnDUiCKGOOGAAQCAJUULCaaCG^ 
eFLPP NDAENEAKQAAEQ 
ia3tfUTBCAGO00GAA ATGTnCOt^^ 

1701 ♦ ♦ + --- 

AGCTICTAOGTOCGGCTTTACAAAODGGCAAAOGCLlAi' 1 1 L.'l CACCGGGAGCA 
e LELGFMKATQPWSEGE D 

1801 ♦ ♦ ♦ • + ♦ • 

CGGTGAGCGAOGCAAGGACGCGGTGCTaXXXnGCIGCA<^^ 
e NES RE Q A V L R V V II R E S L 

1901 + ♦ + - - • f^!^™?^?^^ 

CItnCAOCXXMCCOCCAACATCGGCAGOGACAlXXXXjVGOGA 

d 

eVTPCRWYGDSYGDSYGDS 

ciGTCxntxxxxxrcTGCGCGCTGCiGCGO^ 

3001 + + + + «■-- 

CA&OGAoaxxxxxaarccGAa^axxxx^G^ 

d I SAAfRAAAGKR SIASQ 
e HE GGWAS SRGE A L II G VP 

GOGCCACI "I GCT1 t»"I HATOCAACGCCAAATGCT V/I J IJGCCATGGCCCGGAC 

2101 ♦ ♦ + 

OCXXXnGAAOCAACACTAGGTPGCGGrTTAOC^CAAGCCCTAOCGOCOCTC 
dRSXTTIWRWISWAlAR V 
e AVQNEDLA LEQQRERPG 
Ga7nTtXXXXXnTXTOMnxr.GCATAGATC^^ 

22)1 ♦ ♦ ♦ «• + 

CGCAMGOCOa^CXXTrGAGaXTrATCTAr^^XXrOGCnrAGGCAGGTC 
d NRARCE AYIARGLGDV. 

eRKARPVRCLDRPRTRGR 
ACCIGAOXAATCGAACAGTCOCX1\ATTO^ 

2301 -• + • ♦ + + + 

TGGACTGOC1 IAGCTTGTCAOCCGTIAAGGCGGGATOGCOGCAGOCTOGGC 
d RVSDFLACNRGLPT PA A 
e QRPR VTR LEA RAA DS GG 

B.9»II.r 

CAGCTCXXXXnXXATCTCGCIGAGTGTCTAGAGATCTGAGTCG^ 
2401 + + + + + 

CTDCAGOCGGAGCTACACOC^CTC^CACATOCTAGACTCAOCrCGGrAATC 
dLEAEIESLTYLDEIILWE T 
e ARGREPQTELSRLPA MG 

S,cx,l,I C-TPCR MUTAGENOSISPHV JI0-300S 

CTCX3GGTOXXXntX*CG00CXXXXX^ 
2301 ♦ ♦ ♦ ♦ «... 

GACGCCAGOGGCACCn H.UA.lXXXXnTCOGGAAGCCGa; 1 I.LUHJUGYAC 

d RDGDVARRLGEACAA mt 
c QPRRRRRA SPRRRV 



TSACTOOOCAOiCKXXnuroCGTOtt 
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+- ♦ — — — ♦ — + ieoo 

COCTroOCGC aU JLl IOCGOOCXXaPWXKXXSACCCGnAAOOCAAT 
ITCI C CaXAOOC l CTI roC IC GCCCGATAGOOGAGIPGCITAACGGTC 

+ ♦ ♦ * 17D0 

AAGAOGGOGTG0G^G4AAGGA0C00GCTAIOCXXTC4GO6AATIGOC4C 
KAAREKRARYGLRKVTD 
OGTtXTIXLiTAGGCGa^TGOGTlra 

- + ♦ + + •-•♦1800 

XXAGCACTATCXXXXXXTACGCAAGOGaXCACCTOGGACGAGCXX^ 
DEYARI RE R R A A QEAL 
AGTOCX X nt A TiaUCOE XrTiaX XXXXJCCICAOG^ 

+ + -f - + 1900 

- CAGGCCACTAAGCTOGaX^GCaXXXXX^ 

GT IRAGEAATVRRKKR 

M.c.a.ICATG DUPLICATION pHVII0-300s 

C1GATAGCAAlCanC££IBGClGAaXXX^ 

+ +- + + + 2000 

<^TATCX7ITACGGAGGrA(XXl<aXMXXintliAACXXXKXXXXXX7^ 
• L L A SMA SA SKARRAV 
MAT GGESVR VKRAACS 
GCGCGCACTGAGT(7rcGanX^AGA<XACG^ 

♦ ♦ ♦ + -... +2100 

0GO0CG1GAC1CACAO0GGAGCATCIGGTCCTAGGGC4GGGGGGTnA 
RAS LTAEYVVI GDAWI 
ACQTEGRLGRD RGG LE 

G-APCR MUTAGENOSI5 tiwf 
P h Vl.l 0-300$ IF ~2jL 

CrcCCTGTCOW^AGCGGTCCGGGACACACXriXXTITGC^ 

+ + + + 22D0 

GAGCCACACCCCATDCCCAGCOCCnTIXTrcCAGCAACXTICCCCTXAACC 
ESDPLPGPCVDNCPFEA 
RQGTAT RSVRRQVP IR 

'11U XO GTO CG CACGTAGATCOCXIATSACGGOCGGACGATAGGCOCACA 

♦ -•* + -•---•-•-+ 2300 

CAAGGOCCAGCCClCCATCrAGGCGTACn3C0GO0CIGCTAlCCGGGlCT 
XRTP LYIR MLA PRYAWL 
EPDAPLDAEPRSSLGVV 

cmur A crnxncrociC Aaxx^^ 

♦ ■»- + +2W0 

GAAACATGCACCAGACXjlACll>LtAVl LXXXXTGOCACCGTACAAGCGCGG 
KYTTQQRWR PPPMWAG 
QVEDAS A LAAT AHERR 

CTnm^CGa^TCTIGCOCCG CJ Oiri 1 i uvic atgaggocigagtaa 
+ ------ + 25D0 

caaagggtcx^xtacaaxxxxxx^uaaa 

ewaihgpmktmlgsys 
ng l rwp rp kqdep rl l 

ATCCGAGCGGCTIA0P000CtXX3 AT inX3^ 

♦ +- + — * — - — --+2SX) 

ATAaXTOGCXX^TCraRXXXSCATM^ 

ALPKRRAYETRPVPAN 



GTIGGGCAGCXXMTCOGAOOOOOOCGGCOGAGO(^1C(XLiGOOCro^ 
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GTCIXXUiranCIVUOTXrAaUCCATC 

zxn ♦ ♦ ♦ ♦ 

CAGACtrrACCACATGCACC. TGCIIA, 1 LU'ILtAAAOCGTCCOGACAAC 
dDPNDVDVVLLNALATP 

ATCDOCrOGAGCAGATlXiaX.1 lUX^UXXX3XACTACGCX^GOCA 
2801* ♦ ♦ ♦ 

lAGGGGAOCTDGTCIAOQiCOCAAOtanOMOtXnCATOgXTCXXrr 
d G E L L DOSALPWYPLW 
• TAATCACCXXTIinATGGTOCCACACCACX^^ 

2S0h ♦ ♦ + ♦ 

d ATIAGlPCXX^CATAOCXGGCITnpaTO^ 

TVPTNDSVLE LDSI BS 

TGATGAAACAOCAOCCACAGOOC^CGlOOOOCAAOCACCnTrACCAAC 
300|.. .. .... ........... 4.. |.. + 

ACiA CTiTOrcGTcx^ roi axgiocitBa xxn TO -fl 

dSSVC AVASCCWGGTC £Q 
CltTTOOCOCOOCTACAOCXXXXrTTAGAaXXnTACAOC^^ O 

3101- + ♦ ♦ «• 

GAGACOGCCGGOiTCrrGCGGCGGAJWCTGGG^ O 



s,p,u}. 



CIt*40CaCCClTTATOG0GlAaiAATCXXXnCTCI^^ x 

3Z)h ♦ ♦ ♦ + r 

CAGTWUaXAAATaaxXXKXTT^^ g 

33Q|. mn . ... ..+.,.. 

• pCTPSOCOOTOOOOOOOOOCOOCAC^ni UTlOO00CACTCAOCAOC7ro ^ 

axscaxATGACtxwxxjcmcociaxrit^ Jr 
caraxxnAcraxxxraGAATG^ 2 

AGCCXTOXXXTITnAAGCCrcAAI 1 ICCITG1CICOGAATOCAACTCG 

3SD1- ♦ ♦ ♦ ♦ * 

TCCCGACCCGGAAAAATTXXXLICITAAAOCAACAGACGCITAGCITGAC 

ACACATGAOCAACnOSATAACGTlODCGCTOCATCIXXCTCGOCOCTT 

3fflh ♦ — ♦ ♦ ♦ + 

TGTCTALJ IlVAAGClATTCCAAGAGOOCAGCrAGACCGAGCOOGCAA 
ACAGCTi L-l L"l UI AAGOX^TCXIXXXL\CCAGA(^GCaXTCAGCGCGC [f] 

sm- ♦ ♦ ♦ ♦ ♦ ' 

TCTOGAACACACATTOtXCTAOCXXXXTOGCTUITCCCGCAG^ 



6 

(D 



0 



GOtUTAGGOGACIGTATACTCCCI 1 AACTATCCCCCATCAGAGCAGATTGT 

38D1--- — » — • ♦ ♦ ♦ • 

OCCnATCXXXTCACATATCAOOGiAATTGATACXXXXrrAGTC LL 
ACAAAATAOOCCATCAGCOC CIC I ICOG CI TO C lCOCtCA CICACFCCClG 

lOTITATCGOCTACTOaXXUGAAC^ 

TAAJACGGTIATCCACAGAATXXGCGGATAACGCAGGAAAGAACkTCrGAG 
4001 ♦ + ♦ + 

ATTATGOCAATACCTClOTACTCOCCrArf UCGI LL ' l 1 ItTTCTACACTC 

GnTITPC\TACCCrDD(^O0 D D0 C I A OCAOCATCACAAAAAT0CACBCTC 

410h ♦ ♦ ♦ ♦ ♦ 

CAAAAACGTATDOCACCCOCCCGGACT O CTOC TA CnnTTnACCrDCCA 

CXXXrroCAAG C TO.tlUUlUXXT OU J lU i l LLUACCCITXXX^ 
42D1 ♦ ♦ ♦ ♦ 

GGCa^CCITO^GGGAO^COOGAGAGCA^ 
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. 4. ....... •-+2BCD 

CAACCCCAOCJACATCXXJXXXXICATCCO^ 

M A E I YRRGLASARS QHYI 

(MGCTCX^CAAKTrCGTCCCCCJilO^^ 

♦ ♦ ♦ ♦ +2900 

CIPGACCUGTlTAAGCAGCOGCIOCACOQUnTO 

LQE FED AVMSVS PLLG EN 

'TGAOCXXSCCAaxrACAACtOCACACt^^ 

. 4.. ........ 4........ ..4..... .....4.. ....... .4.. ....... 4 3QQQ 

ACTTxaxcitraxriinTGACG^^ 

LPWGVVACEC A R G D LGS 
CAGGAGGAACACA1GU.-HJU11 HJUAGGAO.il lUJU^XGCZAAGAGCOS 
. + .........+.........+.........+...••.... + •... .....^.jigg 

GTOCI UCl ' I GTGT A OCGlGCAAACCItXTGCAAAGOOOOGGOG A TI L ' ltAX fC 
V L L P V 

CTGAATC0GCO(XnACGAGOCAG4CAGCAO0OGAACnACCXSAGCIGGIGGG 

.> ■» » ♦ *t!** "* ; "-t 3P> 

GGAOTAaXX*XXLiTCClU^'lUIl»lCCTGG 

S GACGTGAGATAOCOGCTACICiCXXriTXXl\AGGCCGA 

♦ + ♦ ♦ -----+ -+3300 

O CltXAClCTATKXXXUTGAGTGGCACXXTRXX ^ 

— ACCAGCAG G ' H» ' i ' l UL^ GCCITCCCTCXjAACTCGAGGACATDCnGCOGAAro 

J . WggTO W^O^CMQCTCBBAA 

£ TTGCTOGCACdGO C TD GA COCXnTAGAGGODCIC 

^ *■ + ♦ ♦ ♦ +3500 

> OUOCAOGTCGOOCAGCIOGGCAAlCIOOCXXSAOOODkCAAGGIGG^ 



X 



J) 
03 



0 



CITGTOCAAGGGTClAICrAOGCITAGTOCAAAGITCAAAOGAGGGGATr 



♦ -+36O0 

P CCAA(>GGTItXX^CATAGATCOCAATCAGGTriCAACl 1 IGCICOCCIA A 

< TCGGTGATGACGCTCAAAACCICra^ 

2 «. ♦ - ♦ - + + 37D0 

ACOCACTACIGOCACTnTOCACACIGTCTAOGTOGAG GO OC ICrD OQIG 
GTCAGCKXnCTIGGCGGGITnaroGti^ 

........ «4».» ......4... ...... 4. ....... .4......... 438OO 

ACTTOCOliCAAOOGOOCACAGCOOOCOGTO 

if) ACIGAGAGTCCAOCATATGOGGTCnCAAATAO^^ 

♦ ♦ ♦3CT 

TGACIlCTCAOGnXTrATACXXXl^C 

axntXOTYXTTIGGGClGaXXXM^ 

......... 4......... 4...... ...4.. ....... -t-4000 

GCCAGCC4CC4ACOCAGAlTATlTlACCAlGGTClAOOGA lyl nCClGA OT 

GAAMGOOCAGCAAAAGAOCAGGMACOGTAAAAAGGOOOOBnGCroOC 
Hi* ♦ ♦ 4- 44I0Q 

GTrrrocoGTDCi mtxxxnecnc oc ATi n illu>cgcaacgaocc 

AACTCAGACGTCXXXIAAAOOCGACAGGACrATAAAGATACXAGGCXnTr 

4 + ♦ •» » +4200 

GTTCAGTCIT3C3ICOCClTIGGGCIXnOCTCATATnO 
OGCATACgI Cia30Cr iTI OOOCriO« XSA^ 

.... 4.-. ....... 4.. ........ 4. ........ 4. ...... ..4.. ....... .44300 

GCCTATOGAGAGOOGGAAAGAGGGAA OCOC I IUAA OOPCGAAAGAGTIA 
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Mh« i AArrr 146 BamH I' 
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^.3 Mac I- 
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AlwN I 4453 
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•Aat II 3149 



•AlwN I 2687 
•ApaL I 2590 




845Ncol- 
!07IEcoRI- 

1309 Bgl II- 
1389 Sol I- 



PCR mutagenesis 
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PCR mutagenesis 
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845Ncol 
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Digest with ApaLI+Bglll. 
Isolate 3360 bp fragment 
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•ApaL I 2590 



Digest with ApaLI+ BgllL 
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•Sfll4084 
-Spl 14002 

•Hpa I 3993 
•Sal t 3984 
•Cla I 3976 
•EcoR I 3966 
•Pst I 3960 
•Pvu II 3957 
•BomH I 3952 
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FIG. I2cx 

e 
I 

GCrAGCCAACAAAGCCGACX.' 1 ll> 1 CI t-AAAATCTCTGATCTTACATTGCAC 

1 ' + ♦ •- + " ♦ + --- 

CGATUUGl 1 Ul'l LX»C1 WIAACACAGAGTTTTAGAGACTACAATCTAACCGTC 
KAN START CONDON 

ACACTAATXCAACCGCnnTA^GOCATATlCAAOOSGGAAAa 
101 + + ♦ + + .„. 

TCnCAmTGTTDTO 
20, ATCGGCTOGCGATAATCTC^^ 

TAO0DC^GCGCrATTAC\GOCCCTTAGTCCACXXnX3^A 

GTTCCCAATGATC7ITACAGATTL\GATtX7TCAGACT 

301 ♦ ♦ ♦ ♦ 

CAAOGGTTACTACAATGflCTAClCTACCAGTOGATT^ 

CATCGTTACTCACCACITXXJATCXXXX^ 
401 + + + _ + .„ 

GTACCAATGAGTGGTCACGCTAGGGGUJL'l 1 111V1 LX/1 AAGGTCCATAATCT 

CtnCCGCDGCTKraTICC A TlU/IG m 
501.... + + + + + __ 

CGACGCGGCCAAOGTAAGCrAAGGACAAAC\TTAACAGGAAAATO * 

GTIXUTSOSAOTtUTTn^I^CGAOOgTAATCQCTCX^'lUl 1 GAACAAG — 

601— ------+ + + + 

CAACTACGCTCACTAAAACTACTGCraXXTACC^ j£ 

TCACTCATGGTGAi'l '1 Ul t^CTTC ATAACCTTATTTnTJACXMGGGG AAATTA u - 
701 -+ + + + + 

AGTCAGTACCACTAAAGAGTGAACTATTGGAATAAAAACi GCTOOCXnTTAA' £ 

GGATCnTCCCATCCTATG<L\ACIXXrTCXy7rCA01 1 1 1L1CLT1CATTACA h- 

801 + + + «. + . ~ 

CCTAGAACGGTACGATACCrrcACCCACCCACn^AAAGACCAACTAATC > 

AACGTCAAAGTAAACrACGAGCTACTC\AAAAG*A*TC 9 
i ii/l I uAATAAATCCAACl'l TI GCTGAGTTCAAGGATCAGATCACCCATCTTC < 

low + + + «... 2 

AACXACTTATTTAGCrrcAAAACEACrcAA ^ 
AACHXXTIOZAaCTACAACAAAGCTCTCATCAAC^^ 

»01 + - " + • '-♦ ♦ + ... 

7TGACCAGGTGGAI U 1 1 l/l i '1 LXiAGAGTAGTTGGCACCGAGGGAGTGAAAGA 
END KAN CASSETTE If R5 BEGIN E. RAP 

CCTCACGAGGCAGACCTCACrAGTTOIACTGAGCGT^ 

1201- +• + + 

GAAGTGCTCCXTItntX^GTGATCAAGGTGACTCGCAGTCIXKX^ 

CTroCAAACAAAAAAAOCAOCCCIAOCAGOP GltXn 1 TGI 1 IGOOGG ATQiA 
1301 + ♦ ♦ + ♦ 



GAAUilTllil Hill 1 W,"llXXXUTGGTCXXX^(XAAACAAACGGCCTAGTr 
ATACCAAATA 1 1 1> 1 ULT 1 UI AGTGTAGCCGTAGTTAGGCCA CCACTTCAAGA 

1401 ♦ + , + ♦ + 

TATGGTTTA1X]ACAGCGAAGATC\CATCGGCATCAATCCGGTGGTGAA 



FIG. \2i=> 
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AAGATAAAAATATATCATCiTCAACAAXAAAACIXnCTCCTIACATAA 

• * ♦ + ♦ +100 

TICTATTITrATATAClACIACITGrTATTTTGACA 
CAGGOCGOTUTrAAATiaXlCAT^ 

♦ ♦ * + + 200 

jC7ITXXXXXX7rAATTTAAGGTlCTACCrACCACTAAATATACXXZATATT 

GGGAAGCCCCATGCGCCAGAOl III i IHGAAACATGGCAAAGGTAGC 

♦ ♦ + + + 300 

ACCCITOXXCTAOXXXnOCAACAAAG^^ 

TTTATCCCIT3TCXXUCCA1CA^^ 

+ ..4. ..4- .... + ......... + 400 

ATAOCGAGAAGGC7tXnAGIT0GTAAA4TAGGCA7YjiAAGGACrACTAC 
AGAATATCCTGA7TCAGGTGA4AAT A1 TCI lO ATGOCCrCGCA OIUlT 

♦ ♦ ♦ + 500 

TCTTATAGGACTAACTCXACrnTATAACAACrACCOGAOCC TC^ 

GCGTATnCGTCTCGCXCAGQCGCAAT^ . 

+ + «• ♦ + 600 

(XXJCATAAAGCAGAGCGAGICOKXnTAGTKlTACT 

TCTGGAAAGAAATGCATAAICI'1'1 'IXJJLATTCrCACCGGAJTCAGTGG 

♦ + ♦ + 700 

GA CCCT1 TCI 1 1 AOCTATTAG4AAAOGGTAAGACrGCOClAAGTCACC 

ATAGGT l l* IATTCATCTroCACCAClXXXMATCGCAGAOCX»ATAOCA 

♦ + " ♦ " - - - + • - + 800 

TTATCCAAGATAACTACAACCIGCTCAGCCTrAGGCT 

GAAACGGC'I'l'1 1 I CAAAAATATXX7TATTCATAATCCTGATATC AATAAA 

♦ »- + + + 900 

1 1, 1 i 1 lAXGAAAAAGTITTTATAOCATAACTATIAGGACTATACTTAT^ 

GGTTCTAACACTCGCAGAGCAITACC^ 

-tm- ♦ v.+— + 1000 

OCAACATlX»lCAO0CTPCroGrI A AT0OCSACICAAC^ 
COGACAACGCAGAO0GTTC0CItX}CAAAGGAAAAGITCAAAATCAGC 
• ♦ ♦ ♦ ♦ + 1100 

ax3tnTOxncT(xxAA<xraco^ 

GCTCGATGATGGGGaUTIC/UXXXI^^ 

.4... ..+........- + .-...-.._+... 4. 1200 

(XX^CCTACTACOXXXTAACICaX^UXATACTCAC 

• 

jGATCAAAGGA TCITCITC AG ATC CI TT n Tre 

....... 1300 

CTAGTITOCrACAAGAACTClAOCAAAAAAA GAOOO GC^TIAGAOGAC 

GAGCTACCAACrCTm 1 UJGAAGGTAACTCGCTrCAGCAGAGOGCAG 

«. ♦ ♦ ■■♦»• +H0O 

CICGATWTITGAGAAAAAGGCnU^T^ 

ACIXTIXrrAGCACCGCCTACAIACXTCGCIUIXXTAATDC^^ . 

1 + +...... ♦ --+1500 

IGAGAOlTOGTCGOCGATCnATIXSACXX^GAOGATlAGGAC^ATXXTrC 



-2-2/^3 

TCGCTCCTGCOMnCGCXIATAAGTOGTCTCTrAC^^ 

1501 + ♦ ♦ + 

AO0G4OCAOGCICAO00CrATICAGCACAGAATtX3OOCAAO^ 

CACACAGOXAGCITGGAGCGAACCACCT^ 

IfiDl ♦ ♦ ♦ * + 

GTGKntJGGGTCKAACCItXCITGCIXX^^ 

AGGTATOOGGTAAGCOGCAGGCrOtXSAACACGAGAGCGCAOGAGGCAGCT 
17m. ....... .4.--. --------- - ♦ - « + --- + • 

TCcmGca^Ticccocnm^Gcc 

TXnT^CTIX^GCCICGA 'lTI'niW^ 

1801 ♦ ♦ ♦ + * 

AGAClTUACTCGCAGCTAAAAACACTACGACCAGTCaXXXXXXnX 

GCCTTITGCTC^CATGTItnTIXXTGCGTTAT^ 

1901- ♦ + + + 

CCGAAAAOGAGTCTACAAGAAACCACXX^ATAGGGGACrAAGAC4CCTATI 

END E. hep M BEG,KI M.r*p 

IjUj* Ifc.o. N 
ACCGAGCGCAACGCGTGCGGCCGCACXX:GTGAGCCG\0^ 

2D01-- ♦ 1 ♦ + + + . 

TGGCTCCCGTITXX^CGCCGGCGTGCGCACTtX^^ 

ACGGGTCTAAGGCGGCGTGTAOGGCCGCCACAGCGGCT^^ 

2101 + + + ♦ ♦ 

TGCCCAGATITXXXXX^CATCXXX^^ 

' TGGGGGTGCTCGGCTCTCGCTGGTCTTCCACCACC^ 

2201 «■ + + ♦ ♦ 

AOCXXXACGAGpCGApVGOGACCAC^ 

TGGAGCICGTtTItXXSAVxATACACCCGTtUTrAA 

2301 + + * + + . 

AariXXAG<L\(^GCCrGGTATC7IXXXXl\CTAATTAGC^ 

GCCGCTGGCAAGCGACGATT7ITXnTXMra;C^ 

2401 ♦ •♦■ + + . 

CCXXX»ACCCTTCGCTGCTAGAACGAGCTCCtZt7TAGA 

AACCTGCITXnXXTIGGACCTACLVCC^ 

2S01 + ♦ + + + - 

TITXIACGACCAGCACCTXX^TCTGGTAGGTCTG^ 
CCAACCGCCACGG\CACGCAGTX7TGCGCACTX^CCC^ 

2601 + + t - - + + 

GGTTCOCGGTTXXrTCTGOGTCACACCXXrrc 

AGGu;r i u x xxxxxxxnx^uiTxx^ACcxxL\<nTACirAr^cix'Aix;A 

2701 + - + + + + 

TCXTGGAAGCCGCGCXXXLVGCTACCGCItKXrGTCAATGAGTC 

CTCTACACACraGO^CATCXLlGGGC^ 

2801 ♦ -'♦ ♦ + + • 

GAGATGTCTGAGTCGGTGTAGCTtXTGGClXXlAGOCGCG 

GGCGGAATTGCGC/lCrGTTCATTCOGTCAGGTT^ 

2301 + - •♦■ + + + - 

<XXXXTTAACGCGTGACAAGCTAAGGCAGTXXj^CACCCC^ 

CGCXATCTATGCCGAGTCXXIACCXT'iCGAVACGOCGAAl 1 1 XX.X ; 1 1 .CAACG 

3001 ♦ ♦ ♦ + 

GCGCrAGATACGGCTCACGClV»CGCGCrriXX^XX7rfAAAGGCACXrrrGC 

AGCATTTGGCGTTGGATCACAACCAAGTCGCGCATTTGGGCGGA ogggat 

3101 + + + + + 
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GATAGTTACXrGGATAACGCGC4GOGGTCGGGCIGAACGGCGGGTTOC7ro 

♦ ♦ * + - + 1(800 

CTATCAATKXSCCTATroOCOGTOCOCAGOOOGACnGOOOOCCAAG 
(XXTITUGCATrGAGAAAGCCCCAC^^ 

+ ♦ + + ---+ I7Q0 

itxx^cTccTAACJtrrritxsoGCTcoGAACcGc'i rcocrcTTrocGccro 

TOCAGOCXjGAAACOCCTGCTA^SCrrrATAGTOCTGICOCGrrrCGCCADC 

♦ *> + + + IffiO . 

GGTCCOCCTT1YXXX»ACCATAGAAATATCAGGACAGOCCAAAGO(X7IGG 

TKX*AAAAACCCeA<"CAA(XXXXX XnTl^ 

+ --- + ♦ + + 1900 

O L 11 1 J 1 GOGCTCGTTCCGCXXXiAAAAATCOCA\GGACOGGAAAACGAC 

CCCTATTACXXXXTTCAGTGA GCTGATACXXKHX^GCCGCAGCCGAACXJ 
H ' + ♦ + ♦ - + 2D00 

^ ggcataattxxxxiaaactcactxgactat^^ 

o 



I 



.... + «• + - - + -- + 2100 

AGCaXXX^CACACOGAGCAT^GGaXX^ 

GAAAaTTCCrrCGAAACCACGCATGTGTl't.l.'l UL'UXJl '1GGTACAGGTGCT 

----- + - ------ •- + -- -- ••••-♦••-•••-.-♦......... + 2200 

K crrrccACGAGcn i u:i vuui acacaacgaggaccaaocatgtccacca 

$ ACCCCCCGAGTXjI vXL\GTroT^XXX71XXXrCCIt^COGAAATA7^ 

X TtrxrCCCIX^CACGTXLiACAOXCAOCCGGGACr^^ ^* 

o cccnt^cccAccraxrcAa^TTixu 
anrr^TccrAr^TTxxr^ACATT^G 

W^r^CCGATCOCCCCCCT^TCTAOCTCCGCr^ 2300 
ACXXXXXGCXXXTITXCATCCCCIKXrCAAOCCCAT^^ 

CCC/UTAOXXXXXXXnAAGCCGCT^^ 

TXXXnTATTXXXDGCCCCAT^nCGCCCACCCTA 2300 
CCAAAAACCXCXGCX^CATCCCCIT-^^ 

******** *■*"**" "*"*"* ■*•--->» • - -•-•♦■•-»---. + - -------- + 2900 

ItTlTriTGCGCOCGGTtnACaXUCOCT^ 
C(XCCGCTGGa7rCACX^CACCAOGTACAMGCGGCT^^ 

+ " " + * • + ----.+.„. + 2900 

— CGGCGOGAroGCAGTITXnCTUGT^^ 

li, CCCrCATGCCGATCTAOCIt^OCACCCCGAAOCTtX^ 

♦ ♦ - + + + 3000 

GGGAGTACGOCTAGATTXIAaX~CIXXX^ 

ALUnTTlVlUXl X 'ACXXX'TACrCGACAGCGAGCTX^ 
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32)1 ♦ ♦ — + — mill' 



(XXXTIACAGCGACGGCTACAACXXXXIAGCXXI^^ 

3301- - -. + ♦ + • + *" 

GCCGATGTTXCTG<XGATC7riGGaX7ITXX^ 



GTTXnTXCGCIXXTTCGCCCAGGAACGCAGCGAG^ 

.....4. •♦■ + + - - + 

CAGCAGGCCGAGCAa^KXTIXXTIXXXTIXXXnX^CCGAGCGGCrc 

CXXXGCAAACCXXXMiACATnCGGGCK^ 
35D1-- + + ~~~1 + * n 

ccccccn rocccGi no iAAAGcccGACx7rAGAccixnxxKLv\Titxxrr -Q 

AAAGGCCCACAAaUAGCOGACAATCCAC^^ J5 

3601 + + + * 

TTTCCGGG'ltj 1 llrC 1 ' 1 UjGCTGTTAGGTGGCGACAAGA'1'1 iax; 1 1 ' rAAC 0 

li. 

CAGGTAAAAGTCCTGGTAG ACGCTAG'l 'I ' 1 1 1 (Jl til* 1 1 1 1 GGGCCATGCCT j- 

3701 +*- ♦ 

GTOCATTrTCAGGAOCATCITCGATCAAAAGAOCAAACCC^ J 

X 

GGGTTCTACG AA'l CI ' 1 lAi 1 CGATACCAAGCCATTTCCGCTO AATATCG 0 

- -- ----•- + -- -- -•--• + »•-•-•*** + ""*"•"""" + * * * * • - - - ^ 

CO^GATGCTTAGAACCAGCTATCXnTCGGTAAAGGCGACTrATAGC ^ 
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End Mm ftp 

TTGTACTIXTITGTtXT^^ 

AACATX*CAACACCACCCTA«^ 

S 

Stop Condons p Begin Transcription Terminator 

3 Frames I 
I 

GACGTAGTTAACTAC<XrrACGATCX;ATCGOCA(KX^TCAAATAAAACG 

^ 1 CTG^TC^TTCATCC^TGCT 
S 

J ? * FIG. I2B- 

1 I 1 
I II 
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FIG. l2Bb 

TOXXXnCAGOCATGCATGGAGCCATItXnATGAGOGAOGGCT^ 

— ♦ ♦ + - •-■•■3300 



+ ♦ ♦ ♦ +3400 

QCACSGC/CDfSCXSC CnOOGC ^^ 



IQCACOCCOCGSkCGICkTCOCtt 

»---•-»"» + ••*•*****♦""***•"*"♦****"""" 35QQ 

ACGtGCOCXX X TTGCG TA GGGGCGGM 
j? CrcGGCTATDCGGCGACGAAAGAGCGTGOGGCAGAACAGGLAAGCCGC^^ 

t«t a «*«««f »#■•#••••! •••••••••f cccaf +3600 

- GAGCOGATAc aj uGciu:n t uhj u caog oj u unmm iuaaj u agt 
GGAGCGGGTGrnXCGGGGGTItXGTGGG 

If + + + ♦ +3700 

U. canOXXXACAGOCXXXXCAAGGCACXXXXX^GGCAAOTltXXtAG^ 

I 

t GTCTCGTi<x xnt?rnxxnTG CGco (xnT^ 

> . + «- + ♦ + - +3800 

CAGAGCAACGCACAAAGCAACGCXXXXL^AAACITATGGTC^ 

u 
t- 

< GGGAGCTCACXXXXZAGAATCGGTCGTIXTrGGTGATGTACGTC 

2 - + + ♦ ♦ ♦ +3900 

TXXTIXX?AGTGGCGGTCTTA(XXZACCAACACCACT 

H 

1 . 

E B P E n 

DXcN vPc d CS 

r b o c m u so 1 I 

R o H I tR I I 

I I V I I I II I II 

TAAATCTAGATATOCATGGATCX^GCItX^GAA^ 

ATTTAGATCTATAGGTACCTAGGrrOCA£Xrit7rrAAGCITC^ 



AAAGGCFCAGTOCAAAGACrGGGCCn IOT 1 1 rATCTGTItjTTTGTCOGCC 

♦ ♦ ♦ ♦ ♦ +4100 

TTCCGAGTCAGCITTCrCACOOGGAAAGCAAAATAGACAACAAACAGGOOG 
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I 

PVUH397I 
BcLf-r»HI3966 
Ncol 3962 
X hoi 3953 
Dro.13947 
BolII 3942 
Konl 3936 
NotI3928 
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1 
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5pH40l6 
H pal 4007 
Sail 3996 
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HinDni3986 
EcoRI 3979 
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Ehep^ Spell 2 19 



| Notl20l7 
PCR AMPLIFY hs p 60 PROMOTER FROM pMV26l 
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T i. a,S FIG. 31c 

j GICGACCACCAAGGGCAOCATtnxnXXTItXXXXAOCX^^ 

CAGCTCGICGTIXXanGGTAGAGAGGAACeCGGI^^ 
GCGAGGGTKXTGAOCCClCCAACItXXXXyiGCAACC^^ 

+ - + 4. + + . 

CGCTCOCAAGGCTCGOGAOGTItMGGGCX!4CG1TGGAACAGGGCC\GATAA 

GCGCAGGOGGGGGGC1UTATT0GTITGTCAGCAT0GAAAGTAGOCAGATCA 

201 ♦ «. «• ♦ + 

OCCGTCCGCCCCCOCAGATAAGCAAACAGTCGTAGCTITGVTCC^ 

30J TTCCAGACOXniGGAAAGAAAAATGGa^GAG 

AACGTCTCG<X^C<jmOn 

Nde I 
I 

401 TGGGTG7C1GCCGACCACATATGGGGOGGTCAAGATAGGTTTTTACCCCCI 
....... - - + - - -- -- + ...... . + . ...... . . + 

ACCACAGACGGCIGGTGTATACOCGGCCAGTTCrATCO\AAAAATGGG<X3 
i01 TTGAAGCCTGAGA GTIXXIACAGGAGTrCXl\ACC<XGTAGCX7riX7rTCA CGAC 

AACTIXXXSACTCTCAACGTGTOCrcAACGTTCGGOCATO J3 
Bamlll = 
1 . 

AGOCCAGOGGGAGGATCCAAGOCTCATAOC»rcAACOCGCAGGACGGTGlX*A O 

601 + --- + ♦ + ... + . 

TCGCGTCGCCCIXXTAGGTItXXJAGTATGCA^ 
IlU V R 

Inl start? x 

CGCGGGCGAGAAGCGGCTCATCGAGATGGAGACCTGGACCXXTCCACAGG t- 

701 + - * ♦ + > 

CGAGCGCCOGCTCTTOGOCGAGTAGCTCTACClXriXXJACCTCGGGAGGTnT > 
Inl LA GEKRLIEMETWT P PQ 

ACGCGGAAGTCGCTCGTGGAGGGCGACCrCGCAGACGGGAGCAGGGATCrG £ 

801 ♦ ♦ ♦ * + - 9 

TGGGCCITCAOOGAGCACCTQGOGCTGGAGCGTCTOOCGTOGTOCCTAGAC < 
Int T R K W LV ER D LA D G • T R D L 2 

CGGTCACAGAGATGACGCCAGCTCIGGTGCC1 GCGTGGTGGGCCXXXIATGG 
901 + +.........+...- + ■ + . . 

GO^GTCTCTCTACIGaX7ia;AGACCACGCACGC\CCAC^ 
Iht VTEMTPA LVRAWWA GMG 

GGTGATGAACACAGCGGTOGAGGACAAGCIGATOGCAGAGAAOCCGTGCCGG 
1001 + . + + + + .. 

CCACTAlTltJ l G'llt X XAOlTLOUni ^ ^ 
IwtVMNTA VE DK L I AEN P C R 

Bgin 
I 

GAGGAGCTGGACATCGrOGOOGC TC AGATCTI^AGCACTACCGGATOGCGG 
1101 + + ♦ + - + .. 

ClXXTCXjACCTGTAGCAGCGGCGACTCTAGAAGCrCGTGATnGCCT^ 
Xnt EELDIVAAEIFEIIYRIAA 

TITXXXIGCAAGGACATOGTCGACGACGGCATCAGGATCM 
DDI + ♦ ♦ + + . 
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FIO. 31 fc> 
CAGCnXCTTZAGAGCXXnGAAXSACAGGG^ 

. ..4 + + 4 4 too ' 

CGTTX^GOt^CICiaXXaCITOCT^^ 
TCTCrrCACIKXXCCAaCIGCAJLTCrG^ 

GAGAAGlGACXnXCTCGA^SGITAGACCACACTTAC^ 

.GGGATGCcrnxAAcaaaaaaGoc^ 

--+- + 4 4 4 300 



^^r,\C^^*™aaA firm AnrrererAc^A rrrrTreAAr.A a 

T^GCTrrayorcrn^ 400 

•U P core 

AGAGlXGAanAGGAGATOA tXTn S °° 
GAGAGGAGACCIAGTKKX2IIADGT0GCGGATG^ 

GCIXnXXnXntKlATCAAGOGTGCAGCGCCTA 

Ps(I 
I 

CGTACIXOGCGClXXAGAOClXaiACAACAAG ATCr.Afmr.Airanr. 

1- 4 ---4 -----4.-.-. 4 700 

2 CCATCATGCXTCGAOGTCTCGATGC'l GT1UIUU1 ACCIGCCGCTTCGGAC 
rO YYALQTYDNKMDAEAW 

IntsUrl? 

X ACCGGGCXSAAGAAGGCAGCCGCCAGCGGCATCACGCrGGAGGAGTAC 

j- ■+■ ..+.........+... + 800 

> CCTCGCCCGC'l 1U1 1 UJt/ilJWKXKTIXXXTGGTAGTGCG ACCTCCTCATG 

> DRAKKAAA S A IT LE E Y 
TACAGCXGGCACGCXXIAGGGCCGCATCTACCCG ^ 

X -4 -4--. ...... 4. ........ 4...... ...4 y0 ° 

£ ATGTCGCOCGTGCGCXTTCGCGGOCnAGATC 

5Y SGMA ERRIYPVLGEVA 

2 GTAGGAAGCACCOGACTGCODGCCXXXIATGGCTACAAOGTOCTCOGGGC 

. ....4 . .4 .....4.. ....... 4 .....4 1000 

ATCCT1 CGTGGGC1GAOOC300GGG0G1ACGGATGTIXSCAGGAGGO00G 
RKIIPTA R RHA YNV L RA 

ATCGAGCAGAAGGCAGCCGATGAGCGOGACXnAGAGGCGCrGAOGCCT 

4 4 4 4 4 HC0 

TA«ntx7itnrcantxxxnAra 

IEQKAADERDVEALTP 

CATACATCCTGGCGTGGACGAGGCTXrCGGTTCCGAGAGCTG^ 

+ + + + 4 1200 

GTATCTAGGAOrGCACCITXnXXGAGGCCAAGCC^ 

YILAWTSLRFGELIEL 
GO0CIGCCGC1 !U JU ^4t *GGGAACAAGA ltU l U;ri G G QUCCCCAA 

........ 4---»-..--4-.-.-.--.4----»----4-------..4 MOO 



FIG. 31 A a. 



GACXXrrCOGGTOGAAGOGTCCTCr G ACGGI lUA A rit A CGTCGCGGAG 
1301 ♦ ♦ + + 

CTCGCAGGCCAGCITCCCAGCACACI^^ 
IntT VRSKR PVTV P PHVA E 



GCATTCCCJGGRGACCACGACGCAGGGCAACOGGCIGTCGAAGrOCG 
1401 + *♦ + + 

CGTAAGGACCACTtXTTGCrGCGTtrCCGTTfXXXXIACA GCTPCAGGCG 
Int A FLVTTTQC N R LSK SA 



GCATCX^CGACTTCXXXXXntTrcGGCGCrACXnTXX^ 
ISM + -•- ♦ * + 

CCTAG<7IXXnXXUGGCGOGACAGa^GATr^^GOnGCGAGlCCX7r 
Int IHDLRAVGATFAAQA 

GGCGATGAAGTACXAGATGGCGTCTGAGGCCXXXrGACGAGGCTATCGC 
1601 * + + + 

CCGCTACTrcATWTICrACCCCAGAC^^ 
Int A MKYQMA SEA RDE AlA 



COCAAr^ACACTCAGTCCTAAAGAGCCGCOl 1 lLTlUlCAGTACGCOAA 
1731 + 

GGGlilUIUlUACTCAGGAl 1 !!JlLlJLXXX^\AAnAACAGTCA^TGC<XTr 

PvuII 
I ' 

GGCACCACCCXXrCCCCCCGCCAGGACCATTXXXXnTCCa^ 
1801 + -'- «. + + + 



GGCGACTmXXKXXUaxnGAGGATXnXXATCACAGAGCC^ 
1901 * + + + + 

CCGCTCAAAGGCaxnTXXUCTCCTACAOTAGTG^ 



CTtXAGGGCXriULOGUJ 1 '1 LAXU tiAGAATACAGAGCCAGCTLLULiCl GCGCCT 
GAwj' i uJJjGAGGOCUX^CGGACTCrrAT 
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Fl O. 3IAb 



ATGAIXXX^AGCXXIACATGAAGGACOCZDIOC^GATGAACAAGGGGCXXGAG 

+ - ♦ + .+ " + +1400 

CTACTAGGCTCGCCTCTACJTOCItXXXSra^ 
MI RA II MKDHTKM NKC P E 



(XTTCACCAAGTCXXnGAAGCGTCGCE^^ 

- + ... + .........+ --- + +1SX) 

CAAGTCGTTCA GCGACTTCGCACCG ATOCCGTTXn'ACOCAGCX!GGCCTTGAGG 
FTKSLKRG YAKIGRPE LR 

5 (XTIGCGACGACCAAGGAGCIGATCXKrDGTC^^ 

co •- + + + + + - + ifioo 

• ccAcrcrcxnxxriTCxnxx^CTA 

O GATTKE LMARLGIITTPRM 
u. 

X IGAGGCGAlXTItrMGCTGGCCAAGACrTCCrc 

t + +- + 17D0 

> ACrCOCCTACACGTTCX^CXXKTrrCIGGAGGACTriXXOT 

KAMSKLAKTS 
-j- Int stop 

U 

I- GAACCACGOCTXXXXXXXL\GOtXX^CCACOG(XGC^^ 

^ + --- + ♦ ••— + + «- --+1800 

2 CTTGGTGCr»GA(XGGOCCTaKXXnCCTWXXXXX;AGACAC^ 



GTrCIX n TC TC CGCOGOCrATGTACUGCroGTOGTrC TA 

+ + +~« ♦ '+ + 1900 

GACAACtOCraXXIXOTACATCTaOG^^ 



CCXXXXnTGCGGlCAAACCIGAOUWCtt 

+ + -+ --+2000 

GOGGCCAAOGCCAGTTTGGACTGGCAGGCTGrCGCXrrGOGG 



SAII 
I 

eX^<XntXSAOGAGOOCXX;iGAia;imGGTQ G AC 

+ +- — + 2089 

AOCTa^GGCIGCIDOGGODUnAaC4C^CX^GCnG 
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